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Method of Controlling Insect Pests 

The present invention relates to a method of controlling species of the Lepidoptera genus 
Ostrinia species preferably Ostrinia fumacalis (Asian Com Borer), in crop plants by use of 
toxin proteins obtainable from Bacillus thuringiensis and/or other Bacillus species. 

Bacillus thuringierisis belongs to the large group of gram-positive, aerobic, 
endospore-forming,bacteria. Unlike other very doselyTefeted species of Bacillus sueh as 
B. cereus or B. arythracis, the majority of the hitherto known Bacillus thuringiensis species 
produce in the course of their sporulation a parasporal inclusion body which, due to its 
crystalline structure. Is generally referred to as a crystalline body. This crystalline body is 
composed of insecticidally active crystalline protoxin proteins, the so-called 8-endotoxins. 

The protein crystals are responsible for the toxicity to insects of Bacillus 
thuringiensis. The 5-endotoxin does not exhibit its insecticidal activity until after oral 
ingestion of the crystalline body, when the latter is dissolved in the intestinal juice of the 
target insects. In most cases the actual toxic component is released from the protoxin as 
a result of proteolytic cleavage caused by the action of proteases from the digestive tract 
of the insects. 

The 6-endotoxlns of the various Bacillus thuringiensis strains are characterized 
by high specificity toward certain target insects, especially with respect to various 
Lepidoptera, Coleoptera and Diptera larvae, and by a high degree of activity against such 
succeptible larvae. A further advantage of Bacillus thuringiensis 8-endotoxins resides in 
the fact that the toxins are hamiless to humans, other mammals, birds and fish. 

The various insecticidal crystal proteins from Bacillus thuringiensis have been 
classified based upon their spectrum of activity and sequence similarity. The 
classification put forth by H6fte and Whiteley. Microbiol. Rev. 53: 242-255 (1989) placed 
the then known insecticidal crystal proteins into four major classes. Generally, the major 
classes are defined by their spectrum of activity, with the Cry! proteins being active 
against Lepidoptera, Cryll proteins against both Lepidoptera and Diptera, Crylll proteins 
being active against Coleoptera, and CrylV proteins against Diptera. 
Within each major class, the 6-endotoxins are grouped according to sequence similarity. 

The Cryl proteins are typically produced as 130-140 kDa protoxin proteins which 
are proteolylically cleaved to produce insecticidally active toxin proteins about 60-70 kDa 
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in Size. The active portion of a 6-endotoxin resides in the NH^-terminal portion of the full- 
length molecule. Hofte and Whiteley. supra, classified the then known Cryl proteins into 
six groups. IA(a). IA(b). IA(c). IB. IC. and ID. Since then, proteins classified as CrylE, 
CrylF, CrylG, CrylH and CrylX have also been characterized. 

The spectrum of Insecticidal activity of an individual ^-endotoxin from Bacillus 
thuringiensis tends to be quite narrow, with a given 5-endotoxin being active against only 
a few insects^Specificity is the result of the efficiencirof the various steps involved in 
producing an active toxin protein and its subsequent ability to interact^with the epithelial 
cells in the insect digestive tract. 

It is one of the objects of this invention to provide a method of controlling Ostrinia 
furnacalis (Asian Com Borer) species in plants, preferalbly cereal crops, including, but 
not limited to the species of maize, wheat, rye, oats. rice, sorghum, millet and related 
crops, forage grasses, bamboo and sugar cane. This object could surprisingly be 
achieved within the scope of the invention by administering a toxin protein of Bacillus 
thuringier)sis such as a Cryl-type toxin protein, to the crop plant to be protected. In 
another embodiment of the invention toxin proteins obtainable from vegetative cultures of 
Bacillus species, so-called Vegetative Insecticidal Proteins (VIPs) such as VI P3 [EP-A 0 
690 916; International Application no EP95/03826, the disclosure of which is 
incorporated herein by reference in its entirety), can also be used to control Ostrinia 
furnacalis {Asian Corn Borer) pests. 

The present invention thus relates to a method for protecting plants including 
progeny thereof against damage caused by Ostrinia furnacalis {Asian Corn Borer) 
species comprising directly or indirectly administering to the plant or the plant seed or the 
growing area of the plant to be protected a toxin protein of Bacillus species, preferably a 
Cryl-type or a VIP-type protein mentioned above, either purely or in the form of an ento- 
mocidal composition comprising at least one of said proteins or a microorganism, 
preferably a Bacillus thuringiensis and/or a Bacillus cereus strain, containing at least one 
toxin gene encoding the toxin protein. Said microorganisms used in the metho^d 
according to the invention may either be naturally occurring strains or, in the alternative, 
recombinant strains comprising a recombinant gene encoding the toxin. 

in a preferred embodiment, transgenic plants are used to administer the toxin to 
the plants to be protected against damage caused by Ostrinia furnacalis {Asian Corn 
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Borer) species. Such plants are obtained by transformation with a toxin gene encoding 
an insecticidal toxin protein from a Bacillus species such as a Cry-type, preferably a Cryl- 
type toxin protein or a VIP-type protein, and expressing said toxin protein in an amount 
sufficient to provide control against Ostrinia fumacalis {Asiar) Corn Borer) species upon 
planting the so transformed plant in an area where said insect pest occurs. 

Entomocidal compositions to be used in the method according to the invention for 
protecting crop plants against Ostrinia fumacalis {Asian Corn Borer) pests for example 
comprise as an active ingredient at least one Cry-type toxin protein, more preferred at least 
one Cryl-type toxin protein, even more preferred at least one CrylA-type toxin protein, 
particularly pretended at least one CrylA(b)-type toxin protein and most particularly preferred 
at least one crylA(b) type toxin protein according to SEQ ID NOS: 53 to 55. even more 
preferred of Bacillus thuringiensis or a microorganism containing at least one gene 
encoding said toxin protein, preferably a Bacillus thuringiensis stram containing at least one 
gene encoding said toxin protein, or a derivative or mutant thereof, together with an 
agricultural adjuvant such as a carrier, diluent, surfactant or application-promoting adjuvant. 
The active ingredient contained in the entomocidal composition may also be a VIP-type 
toxin protein as disclosed in EP-A- 0 690 916 and the PCT International Application No 
EP95/03826 or a combination of Cryl-type and VIP-type proteins. Preferred within the scope 
of protection is aVIPI-type protein, such as a VIP1 A{a) protein or a VIP1 A{b) protein, or a 
VIP2- type protein, such as a VIP2A(a) protein or a VIP2A(b) protein or a combination of a 
VIPl-type protein and a VIP2-type protein or aVIP3-type protein, such as a VIP3A{a) protein 
or a VIP3A(b) protein. 

More preferred within the scope of protection are VIP-type toxin proteins as shown in SEQ 
ID NOS: 1 , 2, 4-7, 17-24. 26-32. 35, 36. 39. 40. 42. 43. 45. 46, 49. 50, 51 or 52. 

The composition may also contain a further biologically active compound. Said 
compound can be both a fertilizer or micronutrient donor or other preparations that 
influence plant growth. It can also be a selective herbicide, insecticide, fungicide, 
bactericide, nematicide, molluscide or mixtures of several of these preparations, if 
desired, together with further agriculturally acceptable carriers, surfactants or 
application-promoting adjuvants customarily employed in the art of formulation. Suitable 
carriers and adjuvants can be solid or liquid and correspond to the substances ordinarily 
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employed in formulation technology, e.g. natural or regenerated mineral substances, 
solvents, dispersants, wetting agents, tackifiers, binders or fertilizers. 

The composition may comprise from 0.1 to 99% by weight of the active ingredient, 
from 1 to 99.9% by weight of a solid or liquid adjuvant, and from 0 to 25% by weight of a 
surfactant. The active Ingredient or the composition containing said active ingredient, 
may be administered to the plants or crops to be protected together with certain other 
insecticides or chemicals (1993 Crop Protection Chemicals Reference, Chemical and 
Pharmaceutical Press, Canada) without loss of potency. It is compatible with mosfoTher 
commonly used agricultural spray materials but should not be used in extremely alkaline 
spray solutions if a Cryl-type toxin is involved. It may be administered as a dust, a 
suspension, a wettable powder or in any other material form suitable for agricultural 
application. 

The active ingredient, that is preferably a Cryl-type toxin protein of Bacillus 
thuringiensis and/or one of the VIP-type proteins mentioned previously, or the 
composition comprising said active ingredient may be applied to (a) an environment in 
which the insect pest may occur, (b) a plant or plant part in order to protect said plant or 
plant part from damage caused by an insect pest, or (c) seed in order to protect a plant 
which develops from said seed from damage caused the pest. 

A preferred method of application in the area of plant protection is application to 
the foliage of the plants (foliar application), with the number of applications and the rate 
of application depending on the plant to be protected and the risk of infestation by the 
pest in question. 

The compositions to be used in a method according to the invention are also 
suitable for protecting plant propagating material, e.g. seed, such as fruit, tubers or grains, 
or plant cuttings, from insect pests. The propagation material can be treated with the 
formulation before planting: seed, for example, can be dressed before being sown. The 
active ingredient of the invention can also be applied to grains (coating), either by 
impregnating the grains with a liquid formulation or by coating them with a solid formulation. 
The formulation can also be applied to the planting site when the propagating material is 
being planted, for example to the seed furrow during sowing. The invention relates also to 
those methods of treating plant propagation material and to the plant propagation material 
thus treated. 
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Within the scope of the invention the compositions may be applied in any method 
known for treatment of seed or soil with bacterial strains. For example, see US Patent 
No.4,863.866. The strains are effective for biocontrol even if the microorganism is not living. 
Preferred is, however, the application of the living microorganism. 

Target crops to be protected within the scope of the present invention are those 
that are host plants tor Ostrinia fumacalis {Asian Corn Borer) species and include but are 
not limited to the species of maize^ wheat, barley, rye, oats, rice, sorghum, millet and 
related crops, forage grasses, bamboo and sugar cane. 

The active ingredient according to the invention may be used in unmodified form or 
together with any suitable agriculturally acceptable carrier. Such carriers are adjuvants 
conventionally employed in the art of agricultural formulation, and are therefore 
formulated in known manner to emulsifiable concentrates, coatable pastes, directly 
sprayable or dilutable solutions, dilute emulsions, wettable powders, soluble powders, 
dusts, granulates, and also encapsulations, for example, in polymer substances. Like 
the nature of the compositions, the methods of application, such as spraying, atomizing, 
dusting, scattering or pouring, are chosen in accordance with the intended objective and 
the prevailing circumstances. Advantageous rates of application range from about 50 g to 
about 5 kg of active ingredient (a.i.) per hectare ("ha", approximately 2.471 acres), and 
preferably from about 100 g to about 2 kg a.i./ha. Preferred rates of application are 200 g 
to about 1 kg a.i./ha or 200 g to 500 g a.lVha. 

For seed dressing advantageous application rates range from 0.5 g to 1000 g a.i. per 
100 kg seed, preferably from 3g to 100 g a.i. per 100kg seed. Most preferred are 
application rate from 10 g to 50 g a.i. per 100 kg seed. 

Suitable carriers and adjuvants can be solid or liquid and correspond to the substances 
ordinarily employed in formulation technology, e.g. natural or regenerated mineral 
substances, solvents, dispersants. wetting agents, tackifiers, binders or fertilizers. The 
formulations, i.e. the entomocidal compositions, preparations or mixtures thereof with other 
active ingredients, and, where appropriate, a solid or liquid adjuvant, are prepared in known 
manner, e.g., by homogeneously mixing and/or grinding the active ingredients with 
extenders, e.g., solvents, solid carriers, and in some cases surface-active compounds 
(surfactants). 



wo 97/46105 



PCT/EP97/02737 



6 - 



Suitable solvents are: aromatic hydrocarbons, preferably the fractions containing 8 
to 12 carbon atoms, e.g. xylene mixtures or substituted naphthalenes, phthalates such as 
dibutyl phthalate or dioctyl phthalate. aliphatic hydrocarbons such as cyclohexane or 
paraffins, alcohols and glycols and their ethers and esters, such as ethanol. ethylene glycol 
monomethyl or monoethyl ether, ketones such as cyclohexanone, strongly polar solvents 
such as N-methyl-2-pyrrolidone. dimethylsulfoxide or dimethylformamide. as well as 
vegetable oils or epoxidised vegetable oils such as epoxidised coconut_pil or soybean oil; or 
water. - ^ . 

The solid carriers used. e.g.. for dusts and dispersible powders, are normally natural 
mineral fillers such as calcite. talcum, kaolin, montmorillonite or attapulgite. In order to 
improve the physical properties it is also possible to add highly dispersed silicic acid or 
highly dispersed absorbent polymers. Suitable granulated adsorptive carriers are porous 
types, for example pumice, broken brick, sepiotite or bentonite; and suitable nonsorbent 
carriers are materials such as calcite or sand. In addition, a great number of pregranulated 
materials of inorganic or organic nature can be used. e.g. especially dolomite or pulverized 
plant residues. 

Depending on the nature of the active ingredients to be formulated, suitable surface- 
active compounds are non-ionic, cationic and/or anionic surfactants having good 
emulsifying, dispersing and wetting properties. The term "surfactants" will also be 
understood as comprising mixtures of surfactants. Suitable anionic surfactants can be both 
v/ater-soluble soaps and water-soluble synthetic surface-active compounds. Suitable soaps 
are the alkali metal salts, alkaline earth metal salts or unsubstituted or substituted 
ammonium salts of higher fatty acids (C sub 10 -C sub 22). e.g. the sodium or potassium 
salts of oleic or stearic acid, or of natural fatty acid mixtures which can be obtained, e.g. 
from coconut oil or tallow oil. Furjher suitable surfactants are also the fatty acid 
methyltaurin salts as well as modified and unmodified phosphoiipids. 

IWIore frequently, however, so-called synthetic surfactants are used, especially fatty 
sulfonates, fatty sulfates, sulfonated benzimidazole derivatives or alkylarylsulfonates. The 
fatty sulfonates or sulfates are usually in the forms of alkali metal salts, alkaline earth metal 
salts or unsubstituted or substituted ammonium salts and generally contain a C sub 8 -C 
sub 22 alky! radical which also includes the alkyi moiety of acyl radicals, e.g. the sodium or 
calcium salt of lignosulfonic acid, of dodecylsulfate. or of a mixture of fatty alcohol sulfates 
obtained from natural fatty acids. These compounds also comprise the salts of sulfuric acid 
esters and sulfonic acids of fatty alcohol/ethylene oxide adducts. The sulfonated 
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benzimidazole derivatives preferably contain 2 sulfonic acid groups and one fatty acid 
radical containing about 8 to 22 carbon atoms. Examples of alkylarylsulfonates are the 
sodium, calcium or triethanolamine salts of dodecylbenzenesulfonic acid, 
dibutylnaphthalenesulfonic acid, or of a naphthalenesulfonic acid/formaldehyde 
condensation product. Also suitable are corresponding phosphates, e.g. salts of the 
phosphoric acid ester of an adduct of p-nonylphenol with 4 to 14 moles of ethylene oxide. 
Non-ionic surfactantare preferably polyglycol ether derivatives of aliphatic or cycloaliphatic 
alcobots, or saturated or unsaturated fatty acids and alkylphenols. said derivatives 
containing 3 to 30 glycol ether groups and 8 to 20 carbon atoms in the (aliphatic) 
hydrocarbon moiety and 6 to 18 carbon atoms in the alkyi moiety of the alkylphenols. 

Further suitable non-ionic surfactants are the water-soluble adducts of polyethylene 
oxide with polypropylene glycol, ethylenediaminopolypropylene glycol and 
alkylpolypropylene glycol containing 1 to 10 carbon atoms in the alkyI chain, which adducts 
contain 20 to 250 ethylene glycol ether groups and 10 to 100 propylene glycol ether groups. 
These compounds usually contain 1 to 5 ethylene glycol units per propylene glycol unit. 
Representative examples of non-Ionic surfactants are nonylphenolpolyethoxyethanols, 
castor oil polyglycol ethers, polypropylene/polyethylene oxide adducts, 
tributylphenoxypolyethoxyethanol, polyethylene glycol and octylphenoxypolyethoxyethanol. 
Fatty acid esters of polyoxyethylene sorbitan. such as polyoxyethylene sorbitan trioleate, 
are also suitable non-ionic surfactants. 

Cationic surfactants are preferably quaternary ammonium salts which contain, as N- 
substituent. at least one C sub 8 -C sub 22 alkyI radical and. as further substituents. lower 
unsubstituted or halogenated alkyl. benzyl or hydroxyl-lower alkyI radicals. The salts are 
preferably in the form of halides, methylsulfates or ethylsulfates, e.g., 
stearyltrimethylammonium chloride or benzyldi-(2-chloroethyl)ethylammonium bromide. 

The surfactants customarily employed in the art of formulation are described, e.g.. in 
"McCutcheon's Detergents and Emulsifiers Annual", MC Publishing Corp. Ridgewood, N.J., 
1979; Dr. Helmut Stache. "Tensid Taschenbuch* (Handbook of Surfactants). Carl Hanser 
Verlag, Munich/Vienna. 

Another particularly preferred characteristic of an entomocidal composition of the 
present invention is the persistence of the active ingredient when applied to plants and soil. 
Possible causes for loss of activity include inactivation by ultra-violet light, heat, leaf 
exudates and pH. For example, at high pH. particularly in the presence of reductant. 8- 
endotoxin crystals are solubilized and thus become more accessible to proteolytic 
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inactivation. High leaf pH might also be important, particularly where the leaf surface can 
be in the range of pH 8-10, Formulation of an entomocidal composition to be used in a 
method according to the present invention can address these problems by either including 
additives to help prevent loss of the active ingredient or encapsulating the material in such a 
way that the active ingredient is protected from inactivation. Encapsulation can be 
accomplished chemically (McGuire and Shasha. J Econ Entomol 85: 1425-1433, 1992) or 
biologically (Barnes and Cummings, 1986; EP-A 0 192 319). Chemical encapsulation 
involves a process in which the active ingredient is coated with a polymer while^Wological 
encapsulation involves the expression of the 6-endotoxin genes fn a microbe. For biological 
encapsulation, the intact microbe containing the toxin protein is used as the active 
ingredient in the formulation. The addition of UV protectants might effectively reduce 
irradiation damage. Inactivation due to heal could also be controlled by including an 
appropriate additive. 

Preferred within the present application are formulations comprising living microorganisms 
as an active ingredient either in form of the vegetative cell or more preferable in form of 
spores, if available. Suitable fonmulations may consist, for example, of polymer gels which 
are crosslinked with polyvalent cations and comprise these microorganisms. This is 
described, for example, by D.R. Fravel et al. in Phytopathology. Vol. 75. No. 7. 774-777, 
1985 for alginate as the polymer material. It is also known from this publication that carrier 
materials can be co-used. These formulations are as a rule prepared by mixing solutions of 
naturally occurring or synthetic gel-fomiing polymers, for example alginates, and aqueous 
salt solutions of polyvalent metal ions such that individual droplets form, it being possible for 
the microorganisms to be suspended in one of the two or in both reaction solutions. Gel 
formation starts with the mixing in drop form. Subsequent drying of these gel particles is 
possible. This process is called ionotropic gelling. Depending on the degree of drying, 
compact and hard particles of polymers which are stnjcturally crosslinked via polyvalent 
cations and comprise the microorganisms and a carrier present predominantly uniformly 
distributed are formed. The size of the particles can be up to 5 mm. 

Compositions based on partly crosslinked polysaccharides which, in addition to a 
microorganism, for example, can also comprise finely divided silicic acid as the carrier 
material, crosslinking taking place, for example, via Ca"^"*" ions, are described in EP-Al-0 
097 571 . The compositions have a water activity of not more than 0,3. W.J. Cornick et al. 
describe in a review article [New Directions in Biological Control: Alternatives for 
Suppressing Agricultural Pests and Diseases, pages 345-372. Alan R. Liss, Inc. (1990)] 
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various formulation systems, granules with vermiculite as the carrier and compact alginate 
beads prepared by the ionotropic gelling process being mentioned. Such compositions are 
also disclosed by O.R.Fravel in Pesticide Formulations and Application Systems: 11th 
Volume, ASTM STP 1112 American Society for Testing and Materials, Philadelphia, 1992. 
pages 173 to 179 and can be used to formulate the recombinant microorganisms according 
to the invention. Further methods for formulating living microorganism are described in 
WO96/02638. 

The compositions according to the invention are valuable for preventive and/or 
curative treatment ir^ the field of pest control even at low rates of application while being 
well tolerated by and non-toxic to warm-blooded species, fish and plants and have a very 
favourable biocidal spectrum. The compositions according to the invention are active 
against all or individual development stages of Ostrinia fumacalis (Asian Com Borer) pests. 
The insecticidal action of the compounds according to the invention can become obvious 
either directly, i.e. by destroying the pests immediately or only after some time has elapsed. 

The said composition can be provided in form of a chemical mixture comprising 
the toxin proteins in an essentially pure form or in form of a mixture comprising at least one 
of the toxin proteins as part of a microorganism or a transgenic plant. 
In a specific embodiment of the invention, one of the active ingredients may be applied to 
the plant directly by, for example, leaf application as described herein previously, whereas 
the second active principle may be provided by the plant itself upon expression of a 
previously transformed gene encoding the said second principle. 

The entomocidal compositions to be used in the method according to the 
invention usually contain from about 0.1 to about 99%, preferably from about 0.1 to about 
95%. and most preferably from about 3 to about 90% of active ingredient; from about 1 to 
about 99.9%, preferably from about 1 to about 99%, and most preferably from about 5 to 
about 95% of a solid or liquid adjuvant; and from about 0 to about 25%, preferably about 
0.1 to about 25%, and most preferably from about 0.1 to about 20% of a surfactant. 
Whereas commercial products are preferably formulated as concentrates, the end user 
will normally employ dilute formulations of substantially lower concentration. The 
entomocidal compositions may also contain further ingredients, such as stabilizers, 
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antifoams, viscosity regulators, binders, tackifiers as well as fertilizers or other active 
ingredients in order to obtain special effects. 

The present invention also relates to formulations comprising living 
microorganisms as an active ingredient which are present in the form of vegetative cells 
or more in the form of spores, if available. 

A further object of the invention jeiates to the use of recombinant microorganisms 
comprising a toxin gene encoding a toxin protein of Bacillus thuringlensis such as a 
Cryl-type protein, in a method of controlling crop plants against damages caused by 
Ostrinia furnacalis (Asian Corn Borer) species, which recombinant organisms are either 
applied directly to the plant to be protected or the recombinantly produced toxin protein is 
first isolated from the recombinant microorganism and formulated as described above 
before being applied to the crop plant to be protected. The recombinant microorganisms 
may also contain a toxin gene encoding a VIP-type toxin protein as disclosed in the 
EP-A-690 916 and the International Application No EP95/03826 or a combination of genes 
encoding at least a Cry-type toxin and a VIP-type toxin, respectively. 

For recombinant production of the toxin protein in a host organism, the coding 
sequence may be inserted into an expression cassette designed for the chosen host and 
introduced into the host where it is recombinantly produced. The choice of specific 
regulatory sequences such as promoter, signal sequence, 5* and 3' untranslated 
sequences, and enhancer appropriate for the chosen host is within the level of skill of the 
practioneer in the art. The resultant molecule, containing the individual elements linked 
in the proper reading frame, are inserted into a vector capable of being transformed into 
the host cell. Suitable expression vectors and methods for recombinant production of 
proteins are well known for host organisms such as E. coll (see, e,g. Studier and Moffatt. 
J. Mol. Biol t89; 113 (1986); Brosius. DNA 8: 759 (1989)), yeast (see, e.g., Schneider 
and Guarente. Meth. EnzymoL 194: 373 (1991)) and insect cells (see, e.g., Luckow and 
Summers. Bio/TechnoL 6: 47 (1988)). Specific examples include plasmids such as 
pBluescript (Stratagene. La Jolla, CA), pFLAG (International Biotechnologies, Inc., New 
Haven. CT), pTrcHis (Invitrogen, La Jolla, CA). and baculovirus expression vectors, e.g.. 
those derived from the genome of Autographica californica nuclear polyhedrosis virus 
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(AcMNPV). A preferred baculovirus/insecl system is pVI11392/Sf21 cells (Invitrogen, La 
Jolia. CA). 

The recombinantly produced toxin protein can be isolated and purified using a 
variety of standard techniques. The actual techniques which may be used will vary 
depending upon the host organism used, whether the toxin protein is designed for 
secretion, and other such factors a skilled artisan is aware of (see, e.g. chapter 16 of 
Ausubel, F. ef a/., "Current Protocols in Molecular Biology", pub. by John Wiley & Sons. 
Inc. (1994). ~ 

A preferred object of the invention relates to the use of transgenic plants 
comprising and expressing a toxin gene encoding a toxin protein of Bacillus thuringiensis, 
especially a Cryl-type toxin protein, in an amount sufficient to provide control against 
Ostrinia fumacalis {Asian Corn Borer) species, in a method of protecting crop plants 
against damages caused by Ostrinia fumacalis {Asian Corn Borer) pests. The plants can 
be the result of nuclear transformation or plastid transformation (see WO 95/24492}. 
Especially preferred are transgenic plants expressing a CrylA(b) toxin protein of Bacillus 
thuringiensis. The invention also relates to the use of transgenic plants comprising a toxin 
gene encoding a VIP-type protein as described in EP-A-690 916 and International 
Application No EP95/03826. herein incorporated by reference in its entirety. The 
invention also relates to the use of transgenic plants comprising and expressing a toxin 
gene encoding a toxin protein of Bacillus thuringiensis, but especially a Cry-type toxin 
protein, and also comprising and expressing ^ toxin gene encoding a VIP-type protein in 
an amount sufficient to provide control against Ostrinia fumacalis {Asian Com Borer) 
species. A host plant expressing said toxin genes will have enhanced resistance to insect 
attack of Ostrinia fumacalis {Asian Corn Borer) species and will be better equipped to 
withstand crop losses associated with such attack. 

In one preferred embodiment, expression of one or more Bt 5-endotoxins in a 
transgenic plant is accompanied by the expression of one or more VIP-type proteins. This 
co-expression of more than one insecticidal principle in the same transgenic plant can be 
achieved by genetically engineering a plant to contain and express all the genes necessary. 
Alternatively, a plant. Parent 1 , can be genetically engineered for the expression of VIP-type 
proteins. A second plant, Parent 2, can be genetically engineered for (he expression of Bt 
8-endotoxin. By crossing Parent 1 with Parent 2, progeny plants are obtained which 
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express all the genes introduced into Parents 1 and 2. Particularly preferred Bt 8- 
endotoxins are those disclosed in EP-A 0618976, herein incorporated by reference. 

Also comprised by the present invention is the use of recombinant microorganisms 
or transgenic plants comprising a gene encoding DNA molecules which hybridizes to a DNA 
molecule encoding a toxin protein of Bacillus species, but preferably to an oligonucleotide 
probe obtainable from said DNA molecule comprising a contiguous portion of the coding 
sequencelor the said toxin protein at least 10 nucleotides in length, under moderately 
stringent conditions. The invention preferably comprises the use of recombinant 
microorganisms or transgenic plants comprising a gene encoding DNA molecules which 
hybridizes to a DNA molecule encoding a toxin protein of Bacillus thuringiensis or 6 cereus 
especially to a DNA molecule encoding a Cry-type protein or to a toxin gene encoding a 
ViP-type toxin protein, preferably to a CryiA(b) protein. 

Factors that effect the stability of hybrids determine the stringency of the 
hybridization. One such factor is the melting temperature Tm which can be easily calculated 
according to the formula provided in DNA PROBES. George H. Keller and Mark M. Manak , 
Macmillan Publishers Ltd. 1993, Section one: Molecular Hybridization Technology; page 8 
ff. 

The preferred hybridization temperature is in the range of about ZS^'C below the 
calculated melting temperature Tm and preferably in the range of about 12-15*C below 
the calculated melting temperature Tm and in the case of oligonucleotides in the range of 
about 5-10*'C below the melting temperature Tm. 

The invention further relates to a commercial bag comprising seed of a transgenic 
plant comprising at least a toxin gene encoding a toxin protein of Bacillus thuringiensis, 
preferably a Cry-type toxin protein, more preferably a Cryl-type toxin protein, but most 
preferably a CrylA-type toxin protein and expressing the said toxin protein in an amount 
sufficient to provide control against Ostrinia furnacalis {Asian Com Borer) species, together 
with lable instructions for the use thereof for control of Ostrinia furnacalis (Asian Corn Borer) 
pests in crop plants. Preferred within this invention is a commercial bag comprising seed of 
a transgenic plant comprising as an active ingredient a gene encoding at least a Cry-type 
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toxin protein and a VIP-type protein. Especially preferred is a combination of a CrylA(b) 
toxin protein with a VIP-type protein. 

The further object of the invention is a connmercial bag comprising an insecticidal 
composition according to the invention together with lable instructions for the use thereof for 
control of Ostrinia fumacalis {Asian Com Borer) pests in crop plants. 

By plant is meant any plant species which can be genetically transformed by methods 
known in the art, but especially those plants that are host plants for Ostrinfa furnacalis 
(Asian Corn Borer) species including, but not limited to, the following species of plants: 
maize, wheat, barley, rye, oats, rice, sorghum, millet and related crops, forage grasses, 
bamboo (orchardgrass, fescue, and the like), and sugar cane. 

Methods known in the art for plant transformation are discussed below. Host plants 
include, but are not limited to, those species previously listed as target crops. 

The invention further relates to seed of a transgenic plant comprising a gene 
encoding a toxin protein of Bacillus thuringiensis and expressing said toxin protein in an 
amount sufficient to provide control against Ostrinia furnacalis {Asian Corn Borer) 
species, and a commercial t>ag containing said seed. 

By plant is meant any plant species that is a host for Ostrinia fumacalis {Asian Com 
Borer) including, but not limited to, the species of maize, wheat, barley, rye, oats, rice, 
sorghum, millet and related crops, forage grasses, bamboo and sugar cane. 

It has been discovered that the codon usage of a native Bacillus thuringiensis 
toxin gene is significantly different from that which is typical of a plant gene. In particular, 
the codon usage of a native Bacillus thuringiensis gene is very different from that of a 
maize gene. As a result, the mRNA from this gene may not be efficiently utilized. Codon 
usage might influence the expression of genes at the level of translation or transcription 
or mRNA processing. To optimize a toxin gene for expression in plants, for example in 
maize, the codon usage is optimized by using the codons which are most preferred in 
maize (maize preferred codons) in the synthesis of a synthetic gene which encodes the 
same protein as found for the native toxin gene sequence. The optimized maize 
preferred codon usage is effective for expression of high levels of the Bt insecticidal 
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protein. Furlher details for constructing maize-optimized synthetic toxin genes can be 
found in WO 93/07278. herein incorporated by reference in its entirety. 
Toxin genes derived from microorganisms may also differ from plant genes. Plant genes 
differ from genes found in microorganisms in that their transcribed RNA does not 
possess defined ribosome binding site sequence adjacent to the initiating methionine. 
Consequently, microbial genes can be enhanced by the inclusion of a eukaryotic 
consensus translation initiator at the ATG. Clontech (1993/1994 catalog, page 210) has 
suggested the sequence GTCGACCAIGGTC as a consensus translation initiator-forthe 
expression of the £. coli uidA gene in plants. Further, Joshi (NucI Acids Res 15: 6643- 
6653 (1987)) has compared many plant sequences adjacent to the ATG and suggests 
the consensus TAAACA^IGGCT. In situations where difficulties are encountered in the 
expression of microbial ORFs in plants, inclusion of one of these sequences at the 
initiating ATG may improve translation. In such cases the last three nucleotides of the 
consensus may not be appropriate for inclusion in the modified sequence due to their 
modification of the second amino acid residue. Preferred sequences adjacent to the 
initiating methionine may differ between different plant species. By surveying the 
sequence of maize genes present in the GenBank/EMBL database it can be discerned 
which nucleotides adjacent to the ATG should be modified to enhance translation of the 
toxin gene introduced into maize. 

In addition, it has been shown that removal of illegitimate splice sites can 
enhance expression and stability of introduced genes. Genes cloned from non-plant 
sources and not optimized for expression in plants may contain motifs which can be 
recognized in plants as 5' or 3' splice sites. Consequently, the transcription process can 
be prematurely terminated, generating truncated or deleted mRNA. The toxin genes can 
be engineered to remove these illegitimate splice sites using techniques well known in 
the art. 

Many 5-endotoxin proteins from Bacillus thuringiensis are expressed as protoxins. 
These protoxins are solubilized in the alkaline environment of the insect gut and are then 
proteolytically converted by proteases into a toxic core fragment (Hofte and Whiteley. 
Microbiol. Rev. 53: 242-255 (1989)). For 5-endotoxin proteins of the Cryl class, the toxic 
core fragment is localized in the N-terminal half of the protoxin. It is within the scope of 
the present invention that genes encoding either the full-length protoxin form or the 
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truncated toxic core fragment of the novel toxin protein can be used in plant 
transformation vectors to confer insecticidal properties upon the host plant. 
The recombinant DNA molecules can be introduced into the plant cell in a number of 
art-recognized ways. Those skilled In the art will appreciate that the choice of method 
might depend on the type of plant, I.e. monocot or dicot. targeted for transformation. 
Suitable methods of transforming plant cells Include microinjection (Crossway et al., 
BioTechniques 4:320-334 (1986)), electroporation (Riggs et al, Proc. Natl. Acad. Sci. 
USA 83:5602-5606 (1986), ^d^bacfenum-mediated transforniation (Hlnchee et al., 
Biotechnology 6:915-921 (1988)), direct gene transfer (Paszkowski et aL, EMBO J, 
3:2717-2722 (1984)), and ballistic particle acceleration using devices available from 
Agracetus, Inc.. Madison, Wisconsin and Dupont, Inc., Wilmington, Delaware (see, for 
example. Sanford et a!., U.S. Patent 4,945,050; and McCabe et aL, Biotechnology 
6:923-926 (1988)). See also, Weissinger et aL, Annual Rev. Genet. 22:421-477 (1988); 
Sanford et aL, Particulate Science and Technology 5:27-37 91987)(onion); Christou et 
aL, Plant Physiol. 87:671-674 (1988)(soybean); McCabe et aL, Bio/Technology 6:923-926 
(1988)(soybean); Datta etaL, Bio/Technology 8:736-740 (1990)(rice); Klein et aL, Proc. 
Natl. Acad. Sci. USA, 85:4305-4309 (1988)(maize); Klein et aL, Bio/Technology 
6:559-563 (1988)(maize); Klein et aL. Plant Physiol. 91:440-444 (1988)(maize): Fromm et 
al., Bio/Technology 8:833-839 (1990); and Gordon-Kamm et aL. Plant Cell 2:603-618 
(1990)(maize); Svab et al. Proc. Natl. Acad. Sci. USA 8Z: 8526-8530 (1990) (tobacco 
chloropiast); Koziel etal. (Biotechnology H: 194-200 (1993)) (maize); Shimamoto etaL 
Nature 338: 274-277 (1989) (rice); Christou etal. Biotechnology 9: 957-962 (1991) (rice); 
European Patent Application EP 0 332 581 (orchardgrass and other Pooideae)] Vasil et 
al. (Biotechnology 11: 1553-1558 (1993) (wheat); Weeks etal. (Plant PhysioL 102: 1077- 
1084 (1993) (wheal); Wan et al (Plant Physiol 104: 37-48 (1994) (bariey); Umbeck etal, 
(Bio/Technology 5: 263-266 (1987) (cotton). 

One particularty preferred set of embodiments for the introduction of recombinant DNA 
molecules into maize by microprojectile bombardment can be found in WO 93/07278, 
herein incorporated by reference in its entirety. An additional preferred embodiment is 
the protoplast transformation method for maize as disclosed in Application 
EP-A-292 435, hereby incorporated by reference in its entirety. 

The genetic properties engineered into the transgenic seeds and plants described 
above are passed on by sexual reproduction or vegetative growth and can thus be 
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maintained and propagated in progeny plants. Generally said maintenance and propagation 
make use of known agricultural methods developed to fit specific purposes such as tilling, 
sowing or harvesting. Specialized processes such as hydroponics or greenhouse 
technologies can also be applied. As the growing crop is vulnerable to attack and damages 
caused by insects or infections as well as to competition by weed plants, measures are 
undertaken to control weeds, plant diseases, insects, nematodes, and other adverse 
conditions to improve yield. These include mechanical measures such a tillage oLthe soil or 
removal of weeds and infected plants, as well as the application of"Sgrochemicals such as 
herbicides, fungicides, gametocides, nematicides, growth regulants. ripening agents and 
insecticides. 

Use of the advantageous genetic properties of the transgenic plants and seeds 
according to the invention can further be made in plant breeding which aims at the 
development of plants with improved properties such as tolerance of pests, herbicides, or 
stress, improved nutritional value, increased yield, or improved structure causing less loss 
from lodging or shattering. The various breeding steps are characterized by well-defined 
human intervention such as selecting the lines to be crossed, directing pollination of the 
parental lines, or selecting appropriate progeny plants. Depending on the desired properties 
different breeding measures are taken. The relevant techniques are well known in the art 
and include but are not limited to hybridization, inbreeding, backcross breeding, multiline 
breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. Hybridization 
techniques also include the sterilization of plants to yield male or female sterile plants by 
mechanical, chemical or biochemical means. Cross pollination of a male sterile plant with 
pollen of a different line assures that the genome of the male sterile but female fertile plant 
will uniformly obtain properties of both parental lines. Thus, the transgenic seeds and plants 
according to the invention can be used for the breeding of improved plant lines which for 
example increase the effectiveness of conventional methods such as herbicide or pestidice 
treatment or allow to dispense with said methods due to their modified genetic properties. 
Alternatively new crops with improved stress tolerance can be obtained which, due to their 
optimized genetic "equipment", yield harvested product of better quality than products which 
were not able to tolerate comparable adverse developmental conditions. 

In seeds production germination quality and uniformity of seeds are essential product 
characteristics, whereas germination quality and uniformity of seeds harvested and sold by 



wo 97/46105 



PCT/EP97/02737 



-17- 

the farmer is not important. As it is difficult to l<eep a crop free from other crop and weed 
seeds, to control seedborne diseases, and to produce seed with good germination, fairly 
extensive and well-defined seed production practices have been developed by seed 
producers, who are experienced in the art of growing, conditioning and marketing of pure 
seed. Thus, it is common practice for the farmer to buy certified seed meeting specific 
quality standards instead of using seed harvested from his own crop. Propagation material 
to be used as seeds is customarily treated with a protectant coating comprising herbicides, 
jnsecticides, fungicides* bactericides, nematicides, molluscicides or mixtures thereof. 
Customarily used protectant coatings comprise compounds such as captan, carboxin, 
thiram (TMTD*), methalaxyl (Apron*), and pirimiphos-methyl (Actellic*). If desired these 
compounds are formulated together with further carriers, surfactants or application- 
promoting adjuvants customarily employed in the art of formulation to provide protection 
against damage caused by bacterial, fungal or animal pests. The protectant coatings may 
be applied by impregnating propagation material with a liquid formulation or by coating with 
a combined wet or dry formulation. Other methods of application are also possible such as 
treatment directed at the buds or the fruit. 

It is a further aspect of the present invention to provide new agricultural methods such 
as the methods exemplified above which are characterized by the use of transgenic plants, 
transgenic plant material, or transgenic seed according to the present invention to provide 
control against Ostrinia fumacalis {Asian Com Borer). 

To breed progeny from plants transformed according to the method of the present 
invention, a method such as that which follows may be used: maize plants produced as 
described in the examples set forth below are grown in pots Jnja greenhouse or in soil, as is 
known in the art, and permitted to flower. Pollen is obtained from the mature tassel and 
used to pollinate the ears of the same plant, sibling plants, or any desirable maize plant. 
Similarly, the ear developing on the transformed plant may be pollinated by pollen obtained 
from the same plant, sibling plants, or any desirable maize plant. Transformed progeny 
obtained by this method may be distinguished from non-transformed progeny by the 
presence of the introduced gene(s) and/or accompanying DNA (genotype), or the 
phenotype conferred. The transformed progeny may similarly be selfed or crossed to other 
plants, as is normally done with any plant carrying a desirable trait. Similarly, tobacco or 
other transformed plants produced by this method may be selfed or crossed as is known in 
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the art in order to produce progeny with desired characteristics. Similarly, other transgenic 
organisms produced by a combination of the methods known in the art and this invention 
may be bred as is l<nown in the art in order to produce progeny with desired characteristics. 
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EXAMPLES 

The following examples further describe materials and methods used to obtain specific 
embodiments of the present invention. They are offered by way of illustration, and should 
not be interpreted as limitating the disclosure of the specification. 

EXAMPLE 1: General MethodT 

DNA manipulations were done using procedures that are routinely practized in the art. 
These procedures can often be modified and/or substituted without substantively 
changing the result. Except where other references are identified, the procedures are 
described in general text books such as Sambrook et ai., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, second edition, 1989. 

EXAMPLE 2: Plant Transformation Vectors 

Plant transformation is accomplished using the transformation vectors pCIB 4431 and 
pCIB 3064 described in WO 93/07278 and Koziel et al (1993) [Biotechnology Vol 1 1. 
194-200], both disclosures being incorporated herein by reference. 
pCIB4431 is a vector designed to transform maize. It contains two chimeric synthetic Bt 
crylA(b) endotoxin genes expressible in maize the one of them constituting a PEP 
carboxylase promoter/synthetic-cfyMfb; gene, the other one a pollen promoter/synthetic- 
crylA(b) gene. 

pCIB4431 contains the synthetic crylA(b) gene provided in SEQ ID NO: 1 and was 
deposited on September 21. 1992 with the Agricultural Research Sen^ice, Patent Culture 
Collection (NRRL). Northern Regional Research Center, 1815 North University Street, 
Peoria, Illinois 61604, U.S.A. under accession no NRRL B-18998. 
pCIB3064 contains a plant expressible bar gene (615 bp), which was originally cloned 
from Sfrepfomyces /lygiroscop/ct/s [Thompson etal. (1987) EMBO J 6, 2519-2523]. It 
encodes a phosphinotricin acetyltransferase (PAT), conferring tolerance to phosphino- 
tricin. The bar gene is under the control of the CaMV 35S promoter and terminator (OW 
et al (1987) Proc Natl Acad Sci USA 84, 4870-4874J to provide resistance to phosphino- 
tricin. 
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EXAMPLE 3: Production of transgenic maize plants containing the synthetic maize 
CrylA{b) gene 

The example below utilizes a biolistic device to introduce DNA coated particles into maize 
cells, from which transformed plants are generated. 

3.1 Tissue 

Immature maize embryos, approximately 1.5-2.5 mm in length, were excised from an ear 
of genotype 6N615 14-15 days after pollination. The mother plant was grown in the 
greenhouse. Before excision, the ear was surface sterilized with 20% Clorox for 20 
minutes and rinsed 3 times with sterile water. Individual embryos were plated scutellum 
side up in a 2 cm square area, 36 embryos to a plate, on the callus initiation medium, 
2DG4 + 5 chloramben medium (N6 major salts, B5 minor salts, MS iron. 2% sucrose, with 
5 mg/l chloramben, 20 mg/l glucose, and 10 ml G4 additions (Table 1} added after 
autoclaving. 

Table 1 : G4 Additions 

Ingredient per liter medium 



Casein hydrolysate 


0.5 g 


Proline 


1.38 g 


Nicotinic acid 


0.2 mg 


Pyridoxine-HCI 


0.2 mg 


Thiamine-HCI 


0,5 mg 


Choline-HCI 


0.1 mg 


Riboflavin 


0.05 mg 


Biotin 


0.1 mg 


Folic acid 


0.05 mg 


Ca pantothenate 


0,1 mg 


p-aminobenzoic acid 


0.05 mg 


B12 


0.136 pg 
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3.2 Preparation of DNA for delivery 

The microcarrier was prepared essentially according to the instructions supplied with the 
Biolistic device. While vortexing 50 pi 1.0 gold microcarrier, 5 pi of pCIB4431 (1.23 
pg/pl) (#898] + 2 pi pCIB3064 (0.895 pg/pl) [#456] was added followed by 50 pi 2.5 M 
CaCl2, then 20 pt 0.1 M spermidine (free base, TC grade). The resulting mixture was 
vortexed 3 minutes and microfuged tor 10 sec. The supernatant was removed and the 
microcarriers washed 2 times with 250 pi of 100% EtOH (HPLC grade) by vortexing 
briefly, centrifuging^nd removing the supernatant. The microcarriers are resuspended in 
65 pi 100% EtOH. 



3.3 Bombardment 

Tissue was bombarded using the PDS-IOOOHe Bioiistics device. The tissue was placed 
on the shelf 8 cm below the stopping screen shelf. The tissue was shot one time with the 
DNA/gold microcarrier solution, 10 pi dried onto the macrocarrier. The stopping screen 
used was hand punched using 10x10 stainless steel mesh screen. Rupture discs of 
1550 psi value were used. After bombardment, the embryos were cultured in the dark at 
25- C. 

3.4 Callus formation 

Embryos were transferred to callus initiation medium with 3 mg/l PPT 1 day after 
bombardment. Embryos were scored for callus initiation at 2 and 3 weeks after 
bombardment. Any responses were transferred to callus maintenance medium, 2DG4 + 
0.5 2.4-D medium with 3 mg/L PPT. Callus maintenance medium is N6 major salts, B5 
minor salts, MS iron, 2% sucrose, with 0.5 mg/l 2.4-D, 20 mg/i glucose, and 10 ml G4 
additions added after autoclaving. Embryogenic callus was subcultured every 2 weeks to 
fresh maintenance medium containing 3 mg/L PPT. All callus was incubated in the dark 
at 25°C. 

The Type I callus formation response was 15%. Every embryo which produced callus 
was cultured as an individual event giving rise to an individual line. 



wo 97/46105 



PCT/EP97/02737 



-22- 

3.5 Regeneration 

After 12 weeks on selection, the tissue was removed from callus maintenance medium 
with PPT and was placed on regeneration medium. Regeneration medium is 
.0.25MS3S5BA (0.25 mg/l 2,4 D, 5 mg/l BAP, tAS salts. 3% sucrose) for 2 weeks followed 
by subculture to MS3S medium for regeneration of plants. After 4 to 10 weeks, plants 
were removed and put into GA 7's. 

EXAMPLE 4: Analysis of transgenic maize plants 

4.1 ELISA Assay 

Detection of crylA(b) gene expression in transgenic maize is monitored using Asian corn 
borer insect bioassays and ELISA analysis for a quantitative determination of the level of 
crylA(b) protein obtained. 

Quantitative determination of crylA(b) insecticidal protein in the leaves of transgenic plants 
is performed using enzyme-linked immunosorbant assays (ELISA) as disclosed in Clark M 
F, Lister R M, Bar-Joseph M: ELISA Techniques. In: Weissbach A, Weissbach H (eds) 
Methods in Enzvmoloav 118:742-766. Academic Press. Florida (1986), Immunoaffinity 
purified polyclonal rabbit and goat antibodies specific for the B. thurinoiensls subsp. kurstaki 
insecticidal protein are used to determine the amount of insecticidal protein per mg soluble 
protein from cmde extracts of leaf samples. The sensitivity of the double sandwich ELISA is 
1-5 ng insecticidal protein per mg soluble protein using 50 \xg of total protein per ELISA 
microtiter dish well. 

Corn extracts are made by grinding leaf tissue in gauze lined plastic bags using a hand held 
ball-bearing homogenizer (AGDIA, Elkart IN.) in the presence of extraction buffer (50 mM 
Na2C03 pH 9.5, 100 ml^ NaCi, 0.05% Triton. 0.05% Tween. 1 mM PMSF and 1 pM 
leupeptin). Protein determination is performed using the Bio-Rad (Richmond, CA) protein 
assay. 

4.2 Asian Com Borer Assav 

One to four 4 cm sections are cut from an extended leaf of a corn plant. Each leaf piece is 
placed on a moistened filter disc in a 50 x 9 mm petri dish. Five neonate Asian corn borer 
larvae are placed on each leaf piece (making a total of 5-20 larvae per plant). The petri 
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dishes are incubated at 29.5 ^'C. Leaf feeding damage and mortality data are scored after 
24, 48, and 72 hours. 

EXAMPLE 5: Ostrinia fumacalis (Asian Corn Borer) Field Testing Assay 

Small peat pots containing transgenic seedlings which were first tested for the presence 

and the expression of the transgene, are transplanted into the field. Non-transgenic inbred 

lines are planted in the same field over a six week period, to sen^e as controls and for — ^ 

pollinations. 

When plants in the field reach 40 cm of extended leaf height, infestation with laboratory- 
reared ostrinia furnacaiis (Asian Com Borer) larvae begins on both the transgenic and non- 
transgenic control plants. About 300 neonate larvae mixed with corn cob grits are 
introduced into the whorl of each plant using a Davis inoculator. Infestations continues on a 
weekly basis for four weeks to stimulate first generation Asian Com Borer. Starting two 
weeks after the initial infestation, each plant is rated weekly for four weeks using a 1 to 9 
scale (1= no visible leaf injury; 9s=most leaf with long lesions, several leaves with broken mid 
ribe, possibly stunted plants due to Asian Com Borer feeding). A mean Asian Com Borer 
damage rating score is calculated for each transgenic and non-transgenic control plant. As 
each plant reaches anthesis, 300 larvae/plant are applied weekly for four weeks to stimulate 
second generation infestation. One hundred of neonate larvae in com cob grits are 
introduced into the leaf axil at the primary ear and at the leaf axil one node above and 
below the primary ear node. Therefore a total of approximativety 2400 lan^ae are applied to 
each plant. About 50 days after the initial second generation infestation, stalks from all 
transplanted and some non-transgenic plants are harvested. The extent of intemal second 
generation infestation tunneling damage in the whole plants is detemnined. 

EXAMPLE S: Assay of extract from transformed protoplasts for insecticidai activity 
against Ostrinia fumacalis (Asian Corn Borer) 

Western blot analysis is performed using extracts obtained from maize cells which had 

been transiently transformed with DNA to express the maize optimized gene. 

Qualitative insect toxicity testing is carried out using harvested protoplasts. Suspensions 

are prepared for each replicate tested in the bioassays. A replicate is considered positive if 

it causes significantly higher mortality than the controls. For example, replicates are tested 

for their activity against insects in the order Lepj do ptera by using the Asian corn borer, 



wo 97/46105 



PCT/EP97/02737 



-24 - 

Ostrinia furnacalis . One-hundred |jl of a protoplast suspension in 0.1% Triton X-100 is 
pipetted onto the surface of artificial Black cutwornY diet, (Bioserv, Inc.. Frenchtown. NJ; 
F9240) in 50 mm X 10 mm snap-cap petri dishes. After air drying 10 neonatal larvae are 
added to each plate. Mortality is recorded after about 4 days. 

EXAMPLE 7: Ostrinia furnacalis (Asian Com Borer) Plant Dipping Assay 

7.1 Bacillus thurinaiensis (Bt) crystals 

Bacillus thuringiensis (Bt) crystals are prepared for stock suspension with 22 ml of 
distilled water. The suspension is kept in the refrigerator. 

7.2 Parameters recorded 

3 days old larvae of Ostrinia furnacalis [Asian Corn Borer) are allowed to feed on maize 
leaves. Larvae had previously been fed with untreated leaves. 120 hours later the 
number of larvae dead is recorded. The kind of feeding injuries on leaf plants is observed 
in each case. 

7.3 Method for testing 

Plants of two homozygous inbred lines of Zea mays susceptible to Asian Corn Borer are 
used (Lines A and B). Seedling plants aged 9-10 days are dipped in various 
concentrations of Bt protein suspension and are used in feeding experiments, wherein 
larvae are released on dried leaves of seedling plants, 5-10 larvae per plant. The 
seedling plants are covered with nylon mesh bags and kept in a nylon mesh case. 4-5 
concentrations with 4jreplications are tested and mortality is determined. The 
temperature is kept at 21-30**C, 
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7.4 Results 

Table 2: Results for Line A 



Dose [ppm] 


Number of 
Insects 


Number of 
dead insects 


Mortality [%] 


80 


40 


31 


77.5 


40 


40 


29 


72.5 


20 


40 


15 


37.5 


To 


40 


13 


32.5 _ 


5 


40 


5 


12.5 


0 


40 


0 


0.00 


Table 3: Results for Line B 






Dose [ppm] 


Number of 
insects 


Number of 

dead insects 


Mortality [%] 


80 


40 


38 


94.74 


40 


40 


33 


81.58 


20 


40 


28 


68.42 


10 


40 


19 


44.74 


5 


40 


10 


21.05 


0 


40 


2 


5.00 



Two kinds of injuries were clearly distinguished in maize leaves: Bt dipped leaves of the 
seedling plants were lightly damaged, whereas the leaves of control seedlings were 
severely damaged. 

The following LCso-Values were obtained: 

Line A: LC50 = 23.412 ppm (range from 1 7,834 to 30.734) 

Line B: LC50 = 12.234 ppm (range from 9.547 to 15.676) 
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EXAMPLE 8: Ostrinia furnacalis {Asian Corn Borer) Plant Dipping Assay (VIP3A) 

8.1 VIP3A protein 

5mg of VIP3A protein were prepared with 50ml of distilled water in order to prepare 
varying concentration of VIP3 protein: lOOppm, 50ppm, 25ppm, 12.5ppm. 6.25ppm and 
Oppm (check). 

8.2 OstriniaJumacalis I Asian Corn Borer ) 

The pupae collected from farmers field at Racha Buri by Entomology and Animal Science 
Division DOA is order to prepare Larvae (L2) for testing. 

8.3 Parameters recorded 

Data were collected after incubation 5 days by counting number of died larvae and then 
analyzed percentage of mortality of larvae by probit Analysis Program. 

7.3 Method for testing 

Plants of two homozygous inbred lines of Zea mays susceptible to Asian Corn Borer are 
used (Lines B and C). Line B was conducted for potted plant test (4 replications 5 rated 
concentatlon and check) and Line C was conducted for leaf dipping test (4 replications 
with 100, 50, 25ppm and check). Seedling plants aged 10-14 days are dipped in various 
concentrations of VIP3A protein suspension and are used in feeding experiments, 
wherein larvae are released on dried leaves of seedling plants. 5-10 larvae per plant. The 
seedling plants are covered with nylon mesh bags and kept in a nylon mesh case. Cut 
leaves were put in platic blocks and kept in control room four replications were applied for 
this experiment. 
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7.4 Results 

Table 4 : Results for Line B 
Table 4a 



Dose [ppm] 


Number of 
insects 


Number of 
dead insects 


Mortality [%] 


100 


40 


31 


76.92 


50 


40 


26 


64.10 


"25 


- -40" 


16 _ 


38.46 


12.5 


40 


14 


33.33 


6.25 


40 


9 


20.51 


0 


40 


1 


2.50 



Table 4b 



Replication 


number of dead larvae after 5 days 


concentration (ppm) 




Check 


6.25 


12.5 


25 


50 


100 


1(10) 


0 


3 


3 


5 


8 


6 


II (10) 


0 


2 


2 


3 


6 


9 


111(10) 


1 


3 


5 


4 


5 


9 


IV (10) 


0 


1 


4 


4 


7 


7 


Total 


1 


9 


14 


16 


26 


31 



Table 5: Results for Line C 
Table Sa 



Replication 


number of dead larvae after 24 hours 


concentration (ppm) 




Check 


25 


50 


100 


1(10) 


0 


0 


0 


1 


II (10) 


0 


0 


0 


1 


111(10) 


0 


0 


. 0 


0 


IV (10) 


0 


0 


0 


1 


Total 


0 


0 


0 


3 



wo 97/46105 



PCT/EP97/02737 



-28- 



Table 5b 



Replication 


number of dead lan^ae after 48 hours 


concentration (ppm) 




Check 


25 


50 


ICQ 


1(10) 


0 


0 


0 


1 


II (10) 


0 — 


0 


0 


1 


III (loT 


0 


0 


1 


0 


IV (10) 


0 


0 


0 


1 


Total 


0 


0 


1 


3 



Table Sc 



Replication 


number of dead larvae after 72 hours 


concentration (ppm) 




Check 


25 


50 


100 


1(10) 


0 


1 


0 


2 


II (10) 


0 


1 


0 


1 


III (10) 


0 


0 


2 


3 


IV (10) 


0 


0 


2 


1 


Total 


0 


2 


4 


7 



Table 5d 



Replication 


number of dead \arvae after 96 hours 


concentration (ppm) — 




Check 


25 


50 


100 


1(10) 


0 


1 


1 


4 


11(10) 


0 


2 


1 


3 


III (10) 


0 


1 


3 


5 


IV (10) 


0 


1 


4 


3 


Total 


0 


5 


1 


15 
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Table 5e 



Replication 


number of dead lan/ae after 120 hours 


concentration (ppm) 




Check 


25 


50 


100 


1(10) 


1 


3 


2 


7 


II (10) 


0 


2 


1 


4 


III (10) 


0 


3 


5 


5 


IV (10) 


0 


2 


4 


9 


Total 


1 


10 


12 


25 



The following LCso-Values were obtained after 120 hours 
Line B: LC50 = 29.558 ppm (range from 21 .298 to 41 .022) 
Line C: LC50 = 78.498 ppm (range from 53.644 to 1 1 4.866) 
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SBQUEWCE LISTING 



(1) GENERAL INP0RMftTIC3N: 



(i) APPLICAOT; 

(A) NAME: Novartis AG 

(B) STREET: Schwarzwaldallee 215 

(C) CTTf iJBasel 

(E) CXXINIKY: Switzerland 

(F) POOTAL OOOE (ZIP) : 4002 

(G) THiEFHQNE: -i-41 61 69 11 11 

(H) TELEFAX: -i- 41 61 696 79 76 

(I) TELEX: 962 991 



(ii) TITI£ OF INVENTICN: Method of Controlling Insect Pests 
(iii) NUMBER OF SEQUENCES: 55 

(iv) CXMPOTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) CX3MPUTER: IBM PC cotrpatible 

(C) OPERATING SYSTEM: PC-DOS /MS -EXDS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30B 



(2) INFORMATION FDR SBQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTIH: 6049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE3HESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ENA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillxis cereus 

(B) STRAIN: AB78 

(C) INDIVIDUAL ISOLATE: NRRL B-21058 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATIOJ: 1082., 2467 

(D) OTHER INFORMATION: /product= "VIP2A{a)" 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
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(B) LOCATION: 2475. .5126 

(D) OTHER INFORMATION: /note= "Coding sequence for the 100 
kd VIPlA{a) protein. This coding sequence is repeated in SEQ ID 
NO: 4 and translated s^jarately." 

(xi) SEQUENCE DESCMPTION: SEQ ID N0:1: 

ATOGATACAA TGTTOmTA CTTAGACCX3G TAGTCTCTCT AATrTCTTTA ATGCTATATT 60 

CTTTACTITC ATACATITTA ATAGCCATTT CAACCTTATC AGrTATGrmT TCTOGItTIC 120 

CrCCTTmr TCCAOSAGCT CTACSCTOCXST TTAATCXTOT TrroGTAOST TOSCTAAaAA 180 

TATCTCTT3C. TAATICTQCA ATACTTOOCA TCATTCSAA GAAGAATTTC OCCATAQCAT 240 

TAGAGCTATC AAICTICTCA TGAATAGAAA TAAAATCTAC ACCTAQCTCT TTCAATmT 300 

CACrrAACTC AATTAGGrPGT TTTOIAGAGC GAGAAATTCXS ATCAAGriTTC TAAACAACTA 360 

TCTTATCGCC TrTACCTAAT ACnTTAGCA ACTCTTCGAG TTGAGGGCGC TCTTTmTA 420 

TTCCrGTEAT TITCTOCTQA TATAGCCTTT CTACAOCATA TICTTQCAAA QCATCTATTT 480 

GCATATOGAG ATTTTGTTCT TCTGTOCTGA CACGAGCATA ACCAAAAATC AAATTOGTTT 540 

CACnCCTAT CTAAATATAT CTATTAAAAT AGCAOCAAAA AOCTEATTAA ATTAAAATAA 600 

QGAACrnGT TTnOGATAT OGATTITQGT ACTCAATATG GATGAGTITT TAACGCTTTT 660 

GTTAAAAAAC AAACAAGTGC CATAAACGGT CGTmTQGG ATGACATAAT AAATAATCTC 720 

TTTGATTAAC CTAACCITGT ATCXHTACAG CCCAGTITrA TITCTACrrC AACIGACTGA 780 

ATAIGAAAAC AACATGAAQG TTTCATAAAA TXTATATATT TTCCATAAOG GATQCTCTAT 840 

CTrTAQGTTA TAG?ITAAATT ATAAGAAAAA AACAAAOQGA QQGAGnGAAA AAAAGCATCT 900 

TCTCTATAAT TTTACAQGCT CTTTAATAAG AAGQGGQGAG ATTAGATAAT AAATATCAAT 960 

ATCTATCTAT AATTGnTGC TICTACAATA ACTTATCTAA CTTICATATA CAACAACAAA 1020 

ACAGACTAAA TCCAGATTGrT ATATTCATIT TCMTITCrrc CTTTATAAAA TAATITCATA 1080 

A ATG AAA AGA ATG GAG GGA AAG TTG TIT ATG GTG TCA AAA AAA TTA 1126 
Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu 
1-5 10 15 

CAA GTA GOT ACT AAA ACT GTA TIG COT AGT ACA GOT TIC TCT ATA TCT 1174 
Gin Val Val Ihr Lys Thr Val Lai Leu Ser Thr Val Phe Ser lie Ser 
20 25 30 

OTA OTA AAT AAT GAA GTG ATA AAA GCT GAA CAA OTA AAT ATA AAT TCT 1222 
Leu Leu Asn Asn Glu Val lie Lys Ala Glu Gin Leu Asn He Asn Ser 
35 40 45 

CAA AGT AAA TAT ACT AAC TTG CAA AAT CTA AAA ATC ACT GAG AAG GTA 1270 
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Gin Ser Lys lyr Itir Asn Leu Gin Asn Leu Lys lie Thr Asp Lys Vai 
50 55 60 

GAG GAT TIT AAA GAA GAT AAG GAA AAA GCX5 AAA GAA TOG GGG AAA GAA 1318 
Glu Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu 
65 70 75 

AAA GAA AAA GAG TGG AAA CTA ACT GCT ACT GAA AAA GGA AAA ATG AAT 1366 
Lys Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn 
80 85 90 95 

AAT TIT TTA-GAT AAT AAA AAT GAT ATA AAG ACA AAT TAT AAA GAA ATT 1414 

— ^n Phe Leu Asp Asn Lys Asn Asp He Lys 'Thr Asn Tyr Lys-Glu He 
100 105 110 

ACT TTT TCT ATG GCA GGC TCA TTT GAA GAT GAA ATA AAA GAT TTA AAA 1462 
Thr Phe Ser Met Ala Gly Ser Phe Glu Asp Glu He Lys Asp Leu Lys 
115 120 125 

GAA ATT GAT AAG ATG TIT GAT AAA AOC AAT CTA TCA AAT TCT ATT ATC 1510 
Glu He Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser He He 
130 135 140 

ACC TAT AAA AAT GTG GAA COG ACA ACA ATT GGA TTT AAT AAA TCT TTA 1558 
Thr 'Tyr Lys Asn Val Glu Pro Thr Thr He Gly Phe Asn Lys Ser Leu 
145 150 155 

ACA GAA GGT AAT ACG ATT AAT TCT GAT GCA ATG GCA CAG TTT AAA GAA 1606 
Thr Glu Gly Asn Thr He Asn Ser Asp Ala Met Ala Gin Phe Lys Glu 
160 165 170 175 

CAA TTT TTA GAT AGG GAT ATT AAG TTT GAT AGT TAT CPA GAT ACG CAT 1654 
Gin Ehe Leu Asp Arg Asp He Lys Phe Asp Ser Tyr Leu Asp Thr His 
180 185 190 

TTA ACT GCT CAA CAA GTT TCC AGT AAA GAA AGA GTT ATT TTG AAG GTT 1702 
Leu Thr Ala Gin Gin Val Ser Ser Lys Glu Arg Val He Leu Lys Val 
195 200 205 

ACG GTT COG AGT GGG AAA GGT TCT ACT ACT CCA ACA AAA OCA GGT GTC 1750 
Thr Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val 
210 215 220 

ATT TTA AAT AAT AGT GAA TAC AAA ATG CIC ATT GAT AAT GGG TAT ATG 1798 
He Leu Asn Asn Ser Glu TVr Lys Met Leu He Aj^ Asn Gly Tyr Met 
225 230 235 

GTC CAT GTA GAT AAG GTA TCA AAA GTG GTG AAA AAA GGG GTG GAG TQC 1846 
Val His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys 
240 245 250 255 

TTA CAA ATT GAA GGG ACT TTA AAA AAG AGT CTT GAC TTT AAA AAT GAT 1894 
Leu Gin He Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp 
260 265 270 
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ATA AAT GOT GAA GCG CAT AGC TGG GGT ATC AAG AAT TAT GAA GAG TGG 1942 
lie Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp 
21b 280 285 



OCT AAA GAT TTA ACC GAT TOG CAA AGG GAA OCT TTA GAT GGG TAT OCT 1990 
Ala Lys Asp Leu Thr Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala 
290 295 300 

AGG CAA GAT TAT AAA GAA ATC AAT AAT TAT TTA AGA AAT CAA GQC GGA 2038 
Arg Gin Asp Tyr Lys Glu lie Asn Asn Tyr Leu Arg Asn Gin Gly Gly 
305 310 315 

AGT GGA ASFgAA AAA CTA GAT GCT CAA-ATA AAA AAT ATT TCT GAT GCT 2086 
Ser Gly Asn Glu Lys Leu Asp Ala Gin lie Lys Asn lie Ser flsp Ala 
320 325 330 335 

TTA GGG AAG AAA CCA ATA COG GAA AAT ATT ACT GTG TAT AGA TOG TGT 2134 
Leu Gly Lys Lys Pro He Pro Glu Asn He Thr Val Tyr Arg Trp Cys 
340 345 350 



GGC ATG CCG GAA TIT GGT TAT CAA ATT ACT GAT CCG TTA CCT TCT TTA 2182 
Gly Met Pro Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu 
355 360 365 

AAA GAT TIT GAA GAA CAA TTT TTA AAT ACA NTC AAA GAA GAC AAA GGA 2230 
Lys Asp Phe Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly 

370 375 380 

TAT ATG AGT ACA AGC TTA TCG AGT GAA CGT CTT GCA OCT TIT GGA TCT 2278 
Tyr Met Ser 'Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser 
385 390 395 

AGA AAA ATT ATA TTA CGA TTA CAA GTT CCG AAA GGA AGT ADG GGT GOG 2326 
Arg Lys He lie Leu Arg Leu Gin Val Pro Lys Gly Ser Thr Gly Ala 
400 405 410 415 

TAT TTA AGT GCC ATT GGT GGA TIT GCA AGT GAA AAA GAG ATC CTA CTT 2374 
Tyr Leu Ser Ala He Gly Gly E^e Ala Ser Glu Lys Glu He Leu Leu 
420 425 430 

GAT AAA GAT AGT AAA TAT CAT ATT GAT AAA GVh ACA GAG GTA ATT ATT 2422 
Asp Lys Asp Ser Lys^Tyr His He Asp Lys Val Thr Glu Val He He 
435 440 445 

AAA GGT GTT AAG OGA TAT GTA GTG GAT GCA ACA TTA TTA ACA AAT * 2467 
Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn 
450 455 460 

TAAQGAGATG AAAAATATGA AGAAAAAGTT AGCAAGTGTT GTAACGTGTA CGTrATTAGC 2527 



TCCTATGnrr TTGAATQGAA ATGTGAATQC TGTrTACQCA GACAGCAAAA CAAATCAAAT 2587 
TICTACAACA CAGAAAAATC AACAGAAAGA GATOGAOOGA AAAOGATTAC TTQGGTATTA 2647 
TTTCAAAGGA AAAGATTTTA GTAATCTTAC TATGTITGCA OOGACAQGTG ATAGTACTCT 2707 
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TATTTATGAT CAACAAACAG CAAATAAACT ATFAGATAAA AAACAACAAG AATATCAGTC 2767 

TATiaSTTGG ATTGGTTTGA OTCAGAGTAA AGAAACGGGA GATTTCACAT TrAACTTATC 2827 

TGAGGATGAA CAGQCAATTA TAGAAATCAA TOOGAAAATT ATTICTAATA AAGQGAAAGA 2887 

AAAGCAAGTT GKrATTTAG AAAAAGGAAA ATTAGTICCA ATCAAAATAG AGTATCAATC 2947 

AGATACAAAA TTTAATATTG ACACTAAAAC ATTTAAAGAA CTTAAATTAT TTAAAATAGA 3007 

TAGTCAAAACjCAACOCCAQC AAGTCCAGCA AGATGAACTG AGAAATCCTG AATTTAACAA 3067 

GAAAGAA1CA CAGGAATTCT TAQOGAAACC ATCGAAAATA AATCTTITCA CICAAAAAAT 3127 

GAAAAQQQAA ATIGAaXSAAG ACACX3GATAC GGATQQGGAC TCTATTOCTG ACX:nTQQGA 3187 

AGAAAATQQG TATACX3AT1C ACAATAGAAT CGCTGTAAAG TOQGAOGATT CTCTAQCAAG 3247 

TAAAQQGTAT AOGAAATTIG TTTCAAATOC ACTAGAAAGT CKPCP^TTTG QTC A TCCTTA 3307 

TACAGATTAT GAAAAQQCAG CAAGAGATCT AGATTTGTCA AATQCAAAGG AAACGTTTAA 3367 

CCCATTQGTA QCTGCTTTTC CAAGTGTCAA TGTTAGTATG GAAAAGGrTGA TATTAICACC 3427 

AAATGAAAAT TTAICCAATA GTGTAGAGTC TCATTCATCC AOSAAITOGT CTTATACAAA 3487 

TACAGAAQGT QCTTCTGITC AAGOQQQGAT TGGACXAAAA QGTATTTOGT TOGGAGriTAG 3547 

CGTAAACTAT CAACACICTO AAACAGTTOC ACAAGAATQG GGAACATCTA CAGGAAA2!AC 3607 

TTOSCAATIC AATAOQQCTr CAGCX3GGATA TTTAAATOCA AATCTTOGAT ATAACAATGT 3667 

AGGAACTQGT GCCATCTACX3 ATOTAAAACC TACAACAAGT TrTGTATTAA ATAAOGATAC 3727 

TATOGCAACT A3TA0QGCGA AATCTAATTC TACAGCCTTA AATATATCTC CTGGAGAAAG 3787 

TTACCOGAAA AAAQGACAAA ATQGAATOGC AATAACATCA ATQGATGATT TTPATTCCCA 3847 

TCOGATTACA TTAAATAAAA AACAAGTAGA TAATCTOCTA AATAATAAAC CTATGATGTT 3907 

GGAAACAAAC CAAACAGATG GTCTTTATAA GATAAAAGAT ACACATQGAA ATATAGTAAC 3967 

TCGCXSGAGAA TOGAATOGTG TCATACAACA AATCAAQQCT AAAACAGCGT CTATrATTCT 4027 

GGATGATGGG GAACXaTTGrrAG CAGAAAAACG 1CTAGCX3C3CA AAAGATTATC AAAATCCAGA 4087 

AGATAAAACA CXXTPCTITAA CTTTAAAAGA TQCCXTTGAAG CITICATATC CAGATGAAAT 4147 

AAAAGAAATA GAQGGATTAT TATATTATAA AAACAAAOXJ ATATACGAAT OGAOCGTTAT 4207 

GACTTACTTA GATCAAAATA CAGCAAAAGA AGrrGACX:AAA CAATTAAATG ATAOCACTQG 4267 

GAAATTTAAA GATGTAAGTC ATTTATATGA TGTAAAACTG ACTCCAAAAA TGAATGTTAC 4327 
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AATCAAATTG 


TCTATACTTT 


ATGATAATGC 


TGAGTCl^AAT 


GATAACTCAA TIX3GTAAATG 


4387 


GACAAACACA 


AATATTGnT 


CAGGTX3GAAA 


TAACX5GAAAA 


AAACAATATT CTTCTAATAA 


4447 


TCOQGATGCT 


AATITGACAT 


TAAATACAGA 


TGOTCAAGAA 


AAATTAAATA AAAATCGTCA 


4507 


CTATTATATA 


AGTITATATA 


TGAAGTCAGA 


AAAAAACACA 


CAATGTCAGA TTACTATAGA 


4567 


TQQGGAGATT 


TATCOGATCA 


CTACAAAAAC 


AGTGAATCTG 


AATA/^AGACA ATTACAAAAG 


4627 


ATTAGATATT 


ATAGCICATA 


ATATAAAAAG 


TAATQCAATT 


TCTTCACrrC ATATTAAAAC 


4687 


^iaAATGATGAA ATAACTTTAT 


TITQQGATCA 


TATTTCTATA 


ACAGATGTAG CATCAATAAA 


4747 


ACOGGAAAAT 


TTAACAGATT 


CAGAAATTAA 


ACAGATTTAT 


AGTAGGTATC GTATTAAGTT 


4807 


AGAAGATOGA 


ATCCITATro 


ATAAAAAAGG 


TGGGATTCAT 


TATCGTGAAT TTATTAATCA 


4867 


AGCTAGnrr 


AATATIGAAC 


CATTCCAAAA 


TTATOIGACC 


AAATATCAAG TTACTTATAG 


4927 


TAGTGAGTTA 


GGACCAAACX3 


TGAGTGACAC 


ACTIGAAAGT 


GATAAAATZT ACAAGGATGG 


4987 


GACAATTAAA 


TITCATTTTA 


CTAAATATAG 


TAAAAATCAA 


CAAGGATEAT TTTATGACAG 


5047 


TQGATTAAAT 


TQQGACnTA 


AAATTAATQC 


TATTACTrAT 


GATOGTAAAG AGATGAAICT 


5107 


TTTTCATAGA 


TATAATAAAT 


AGrTATOATA 


TCTATGAAGC 


TOGTOCTAAA GATAGTCTAA 


5167 


AAGTTAATAT 


ACTGTAGGAT 


TGTAATAAAA 


GTAATQGAAT 


TCATATCGTA CTITQGAGrPG 


5227 


QQQGATACTT 


TGTAAATAGT 


TCTATCAGAA 


ACATTAGACT 


AASAAAAGTr ACTAOCOXA 


5287 


CTTCAAAATG 


AAGATICAAC 


TGATIACAAA 


CMccrorTA 


AATATTATAA GGTTTTAACA 


5347 


AAATATTAAA 


CTCTTTATGT 


TAATACTGTA 


ATATAAAGAG 


TITAATIGTA TTCAAATGAA 


5407 


GCTTTCOCAC 


AAAATTAGAC 


TCATEATCTA 


ATCAAATAAT 


CAGrrcTAATT TTGTAGAACA 


5467 


GGTCTGGTAT 


TATTGTACGT 


GGTCACTAAA 


AGATATCTAA 


TATEATIGQG CAAGGCGTIC 


5527 


CATGATTGAA 


TCCTCGAATG 


TCTTGCXXTT 


TrCATTTATr 


TAAGAAGGAT OGTGGAGAAA 


5587 


TTATOGTITA 


GATAATGAAG 


AAAGACTTCA 


CTICTAATTT 


TTGATOTTAA ATAAATCAAA 


5647 


AnTGGOGAT 


TCACATTGTCT 


TAATCCACTG 


ATAAAACATA 


CrOGAGTCTT CTTAAAAAAT 


5707 


CAOiTITmT 


CTTTATAAAA 


TrrrGCTTAG 


CGTACGAAAT 


TCGTGTTTTG TTGGTGGGAC 


5767 


CCCATGCOCA 


TCAACTTAAG 


AGTAAATTAG 


TAATCAACTT 


TCX5TTCATCT QGATTAAAAT 


5827 


AACCTCAAAT 


TAQGACATGT 


TTTTAAAAAT 


AAGCAGACCA 


AATAAGCCTA GAATAOGTAT 


5887 


CyVTTITTAAA 


AATTATQCTG 


cri'iL-riTiii 


rmCCAAAT 


CCATTATACT CATAAGCAAC 


5947 


ACCCATAATG 


TCAAAGACTC 


TmrcicTC 


ATATCGATAA 


GCTTGATATC GAATTOCTGC 


6007 
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AGCCCGGGC3G ATCCACTAGT TCTAGAGCGG COGCCACCGC GG 6049 

(2) INPOPMATiaSf FOR SBQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTO: 462 amino acids 

(B) TOTS: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein — 

(xi) SEQUENCE DESCRIPTION: SBQ ID ND:2: 

Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu Gin 
15 10 15 

Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser He Ser Leu 
20 25 30 

Leu Asn Asn Glu Val He Lys Ala Glu Gin Leu Asn He Asn Ser Gin 
35 40 45 

Ser Lys lyr Ohr Asn Gin Asn Lw Lys He Thr Asp Lys Val Glu 
50 55 60 

Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys 
65 70 75 80 

Glu Lys Glu Trp Lys Leu Thr Ala Tftir Glu Lys Gly Lys Met Asn Asn 
85 90 95 

Phe Leu Asp Asn Lys Asn Asp He Lys Ihr Asn Tyr Lys Glu He Thr 
100 105 no 

Phe Ser Met Ala Gly Ser Phe Glu Asp Glu He Lys Asp Leu Lys Glu 
115 120 125 

He Asp Lys Met Phe Asp Lys Ihr Asn Leu Ser Asn Ser He He Thr 
130 135 140 

Tyr Lys Asn Val Glu Pro Ihr Thr He Gly Ptie Asn Lys Ser Leu Thr 
145 150 155 160 

Glu Gly Asn Thr He Asn Ser Asp Ala Met Ala Gin Phe Lys Glu Gin 
165 170 175 

Phe Leu Asp Arg Asp He Lys Phe Asp Ser Tyr Leu Asp Thr His Leu 
180 185 190 

Thr Ala Gin Gin Val Ser Ser Lys Glu Arg Val He Leu Lys Val Thr 
195 200 205 

Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val He 
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210 21b 220 

Leu Asn Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly lyr Met Val 
225 230 235 240 

His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu 
245 250 255 

Gin He Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He 
260 265 270 

Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala 

275 280 285 

Lys Asp Thr Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Arg 
290 295 300 

Gin Asp Tyr Lys Glu He Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser 
305 310 315 320 

Gly Asn Glu Lys Leu Asp Ala Gin He Lys Asn He Ser Asp Ala Leu 

325 330 335 

Gly Lys Lys Pro He Pro Glu Asn He Thr Val Tyr Arg Trp Cys Gly 
340 345 350 

Met Pro Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys 
355 360 365 

Asp Phe Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly Tyr 
370 375 380 

Met Ser Tbr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg 
385 390 395 400 

Lys He He Leu Arg Leu Gin Val Pro Lys Gly Ser Thr Gly Ala Tyr 
405 410 415 

Leu Ser Ala He Gly Gly Phe Ala Ser Glu Lys Glu He Leu Leu Asp 
420 425 430 

Lys Asp Ser Lys Tyr His He Asp Lys Val Thr Glu Val He He Lys 
435 440 445 

Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn 
450 455 460 

(2) INFORMATION FOR SBQ ID U0:3: 

(i) SBSUEICE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE IWE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .20 

(D) OnHER B3F0RMATIQN: /note= "Signal peptide for vacuolar 
target ting" 



(xi) SBQOBMCE DESCRIPTION: SE3Q ID N0:3: 

Ser Ser Ser Ser Phe Ala Asp Ser Am Pro He AEi" Val Ihr Asp Arg 
1 5 10 15 ^ 

Ala Ala Ser Tlhr 
20 

(2) INFORMATION PCffl SE3Q ID N0:4: 

(i) SBQUE3SICE CHARACTERISTICS: 

(A) LE^R3TH: 2655 base pairs 

(B) TYPE: nucleic acid 
(c) STEWNDEia^ss : single 
(D) TOPOLOGY: linear 

(ii) MOLEX::XJIjE type: CNA (genctnic) 

(iii) HYPOTHEraCAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) Sn^AIN: AB78 

(C) INDIVIDUAL ISOLATE: NRRL B-21058 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2652 

(D) OTHER INFORMATION: /product= "100 kDa protein VIPlA(a) * 
/note= "This sequence is identical to the portion of SBQ ID ND:1 
between and including nucleotide 2475 to 5126.* 



(xi) SBQUEJ3CE EffiSCRIPTION: SBQ ID N0:4: 

ATG AAA AAT ATG AAG AAA AAG TTA GCA AGT GTT GTA AOG TCT ACG TTA 
Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys Thr Leu 
465 470 475 

TTA OCT OCT ATG TTT TTG AAT QGA AAT GTG AAT OCT GTT TAC GCA GAC 
Leu Ala Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Tyr Ala Asp 
480 485 490 
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AGO AAA ACA AAT CAA ATT TCT ACA ACA CAG AAA AAT CAA CAG AAA GAG 144 
Ser Lys Thr Asn Gin lie Ser Thr *nir Gin Lys Asn Gin Gin Lys Glu 
496 500 505 510 

ATG GAC CGA AAA QGA TTA CTT GOG TAT TAT TTC AAA GGA AAA GAT TTT 192 
Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe 
515 520 525 . 

AGT AAT CTT ACT ATG TTT OCA COG ACA CGT GAT AGT ACT CTT ATT TAT 240 
Ser Asn Leu Thr Met Rie Ala Pro Thr Arg Asp Ser Thr Leu He Tyr 
530 535 540 

GAT CAA CAA ACA GCA AAT AAA CTA TTA -GAT AAA AAA CAA CAA GAA TAT' 288 
Asp Gin Gin Thr Ala Asn Lys Leu Leu Asp Lys Lys Gin Gin Glu Tyr 
545 550 555 

CAG TCT ATT CX3T TGG ATT GGT TIG ATT CAG AGT AAA GAA ACG GGA GAT 336 
Gin Ser He Arg Tcp He Gly Leu He Gin Ser Lys Glu Thr Gly Asp 
560 565 570 

TTC ACA TTT AAC TTA TCT GAG GAT GAA CAG GCA ATT ATA GAA ATC AAT 384 
Phe Thr Phe Asn Leu Ser Glu Asp Glu Gin Ala He He Glu He Asn 
575 580 585 590 

GOG AAA ATT ATT TCT AAT AAA QQG AAA GAA AAG CAA GIT GTC CAT TTA 432 
Gly Lys He He Ser Asn Lys Gly Lys Glu Lys Gin Val Val His Leu 
595 600 605 

GAA AAA GGA AAA TTA GTT OCA ATC AAA ATA GAG TAT CAA TCA GAT ACA 480 
Glu Lys Gly Lys Leu Val Pro He Lys He Glu Tyr Gin Ser Asp Thr 
610 615 620 

AAA TTT AAT ATT GAC AGT AAA ACA TTT AAA GAA CTT AAA TTA TTT AAA 528 
Lys Ehe Asn He Asp Ser Lys Thr £he Lys Glu Leu Lys Leu Phe Lys 
625 630 635 

ATA GAT AGT CAA AAC CAA COC CAG CAA CTC CAG CAA GAT GAA CTG AGA 576 
He Asp Ser Gin Asn Gin Pro Gin Gin Val Gin Gin Asp Glu Leu Arg 
640 645 650 

AAT CCT GAA TTT AAC AAG AAA GAA TCA CAG GAA TTC TTA GOG AAA CCA 624 
Asn Pro Glu Phe Asn Lys Lys Glu Ser Gin Glu Phe Leu Ala Lys Pro 
655 660 665 670 

TOG AAA ATA AAT CTT TTC ACT CAA AAA ATG AAA AGG GAA ATT GAT GAA 672 
Ser Lys He Asn Leu Phe Thr Gin Lys Met Lys Arg Glu He Asp Glu 
675 680 685 

GAC AOG GAT ADG GAT GGG GAC TCT ATT OCT GAC CTT TGG GAA GAA AAT 720 
Asp Thr Asp Thr Asp Gly Asp Ser He Pro Asp Leu Trp Glu Glu Asn 
690 695 700 

QQG TAT AOG ATT CAA AAT AGA ATC GCT GTA AAG TOG GAC GAT TCT CTA 768 
Gly Tyr Thr He Gin Asn Arg He Ala Val Lys Trp Asp Asp Ser Leu 
705 710 715 
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GCA AGT AM GQG TAT ACG AAA TTT CTF TCA AAT CCA CTA GAA ACT CAC 
Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His 
720 725 730 



816 



ACA GTT GGT GAT CCT TAT ACA GAT TAT GAA AAG OCA GCA AGA GAT CTA 
Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu 
735 740 745 750 



864 



GAT TTG TCA AAT OCA AAG GAA AOG TIT AAC CCA TIG GTA OCT GCT TTT 
Asp Leu Ser Asn Ala Lys Glu Thr Ph& Asn Pro Leu Val Ala Ala Kie 
755 760 765 



912 



CCA AGT GTG AAT GTT AGT ATG GAA AAG GTG ATA TTA TCA CCA AAT GAA 
Pro Ser Val Asn Val Ser Met Glu Lys Val He Leu Ser Pro Asn Glu 
770 775 780 



960 



AAT TTA TOC AAT AGT GTA GAG TCT CAT TCA TCC AOG AAT TOG TCT TAT 
Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser Ihr Asn Trp Ser Tyr 
785 790 795 



1008 



ACA AAT ACA GAA GGT GCT TCT GTT GAA GCX3 GQG ATT GGA OCA AAA GGT 
Thr Asn Thr Glu Gly Ala Ser Val Glu Ala Gly He Gly Pro Lys Gly 
800 805 810 



1056 



ATT TCG TIC GGA GTT AGC GTA AAC TAT CAA CAC TCT GAA ACA GTT GCA 
He Ser Phe Gly Val Ser Val Asn Tyr Gin His Ser Glu 'Thr Val Ala 
815 820 825 830 



1104 



CAA GAA TOG GGA ACA TCT ACA GGA AAT ACT TCG CAA TTC AAT ACG GCT 
Gin Glu Trp Gly Thr Ser Thr Gly Asn Thr Ser Gin Rie Asn Thr Ala 
835 840 845 



1152 



TCA GOG GGA TAT TTA AAT OCA AAT GTT CGA TAT AAC AAT GTA GGA ACT 
Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Oyr Asn Asn Val Gly Ihr 
850 855 860 



1200 



GGT GOC ATC TAC GAT GTA AAA CCT ACA ACA AGT TTT GTA TTA AAT AAC 
Gly Ala He Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn 
865 870 875 



1248 



GAT ACT ATC GCA ACT ATT ACG GCG AAA TCT AAT TCT ACA QCC TTA AAT 
Asp Thr He Ala Thr He Hhr Ala Lys Ser Asn Ser •nir Ala Leu Asn 
880 885 890 



1296 



ATA TCT CCT GGA GAA AGT TAC CCG AAA AAA GGA CAA AAT GGA ATC GCA 
He Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gin Asn Gly He Ala 
895 900 905 910 



1344 



ATA ACA TCA ATG GAT GAT TIT AAT TCC CAT CCG ATT ACA TTA AAT AAA 
He Hbr Ser Met Asp Asp Rie Asn Ser His Pro He Uir Leu Asn Lys 
915 920 925 



1392 



AAA CAA GTA GAT AAT CTG CTA AAT AAT AAA CCT ATG ATG TTC GAA ACA 
Lys Gin Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr 



1440 
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930 935 

AAC CAA ACA GAT GGT GTT TAT AAG ATA AAA GAT ACA CAT GGA AAT ATA 
Asn Gin Ihr Asp Gly Val lyr Lys He Lys Asp Thr His Gly Asn lie 
945 950 955 

GTA ACT GC3C QC3A GAA TOG AAT GGT GTC ATA CAA CAA ATC AAG GCT AAA 
^ Gly Gly Glu Ti^) Asn Gly Val He Gin Gin He Lys Ala Lys 
960 965 

AOV GOG TCr ATT ATT GTC GAT GAT GOG GAA CGT C?rA GCA GAA AAA CCT 
Thr Ala Ser He He Val Asp AspJS^ Glu Arg Val Ala Glu ^ S 
- 975 ^ 985 

GTA GCX5 GCA AAA GAT TAT GAA AAT CCA GAA GAT AAA ACA COG TCT TTA 
Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Ihr Pro Ser Leu 
995 1000 1005 

ACT TTA AAA GAT QCC CTG AAG err TCA TAT 03^ GAT GAA ATA AAA GAA 
Thr teu Lys Asp Ala Leu Lys Leu Ser TVr Pro Asp Glu He Lys Glu 
1010 1015 1020 

ATA GAG GGA TTA TTA TAT TAT AAA AAC AAA CTG ATA TAC GAA TOG AQC 
He Glu Gly Leu Leu IVr TVr Lys Asn Lys Pro He l^r Glu Ser 
1025 1030 1035 

^J'^JT'^'^^^^^^'^^^^GAAGIGACCAAACAA 
5^^^ Tyr Leu Asp Glu Asn -nir Ala Lys Glu Val Thr Lys Gin 
1045 1050 

TTA AAT GAT ACC ACT GGG AAA TTT AAA GAT GTA ACT CAT TPA TAT GAT 
Leu Asn llir -nir Gly Lys Hie Lys A^ Val Ser His Leu TVr Asp 
1060 1065 r^o 

GTA AAA crc ACT OCA AAA ATG AAT GTT ACA ATC AAA TTG TCT ATA err 
Val Lys Leu -mr Pro Lys Met Asn Val Thr He Lys Leu Ser He Leu 
1075 1080 1085 

TAT GAT AAT GCr GAG TCT AAT GAT AAC TCA ATT GCT AAA TX3G ACA AAC 
Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser He Gly Lys Tip Ihr Asn 
1090 1095 1100 

ACA AAT ATT GTT TCA OCT GGA AAT AAC GGA AAA AAA CAA TAT TCT TCT 
Thr Asn He val Ser Gly Gly Asn Asn Gly Lys Lys Glniyr Ser S 
1105 1110 1115 

AAT AAT COG GAT OCT AAT TIG ACA TTA AAT ACA GAT GCT CAA GAA AAA 
Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gin Glu Lys 
1120 1125 2130 

TTA AAT AAA AAT OCT GAC TAT TAT ATA ACT TTA TAT ATC AAG TCA GAA 

Leu Asn Lys Asn Arg Asp Tyr Tyr He Ser Leu Tyr Met Lys Ser Glu 
1140 1145 j^^gQ 

AAA AAC ACA CAA TCT GAG ATT ACT ATA GAT GGG GAG ATT TAT CCG ATC 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 
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Lys Asn Ite Gin Cys Glu lie Thr He Asp Gly Glu He *Vyr Pro He 
1155 1160 1165 

ACT ACA AAA ACA GTC AAT GTG AAT AAA GAC AAT TAC AAA AGA TTA GAT 2160 
•Thr Thr hys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp 
1170 1175 1180 

ATT ATA OCT CAT AAT ATA AAA AGT AAT CCA ATT TCT TCA CTT CAT ATT 2208 
He He Ala His Asn He Lys Ser Asn Pro He Ser Ser Leu His He 
1185 1190 H95 

^AAfr-TCG AAT GAT GAA ATA ACT TTA TIT TOG GAT GAT ATT TCT ATA ACA 2256 
Lys Thr Asn Asp Glu He Thr Leu Phe Trp Asp Asp He Ser He Ibr 
1200 1205 1210 

GAT GTA OCA TCA ATA AAA CCX3 GAA AAT TTA ACA GAT TCA GAA ATT AAA 2304 
Asp Val Ala Ser He Lys Pro Glu Asn Leu Thr Asp Ser Glu He Lys 
1215 1220 1225 1230 

CAG ATT TAT AGT AGG TAT GGT ATT AAG TTA GAA GAT GGA ATC CTT ATT 2352 
Gin He Tyr Ser Arg Tyr Gly He Lys Leu Glu Asp Gly He Leu He 
1235 1240 1245 

GAT AAA AAA GGT GGG ATT CAT TAT GGT GAA TTT ATT AAT GAA GCT AGT 2400 
Asp Lys Lys Gly Gly He His Tyr Gly Glu Phe He Asn Glu Ala Ser 
1250 1255 1260 

TTT AAT ATT GAA CCA TIG CAA AAT TAT GTG ACC AAA TAT GAA GTT ACT 2448 
Phe Asn He Glu Pro Leu Gin Asn Tyr Val Thr Lys Tyr Glu Val Ttir 
1265 1270 1275 

TAT AGT AGT GAG TTA GGA CCA AAC GTG AGT GAC ACA CTT GAA AGT GAT 2496 
IVr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp 
. 1280 1285 1290 

AAA ATT TAC AAG GAT GOG ACA ATT AAA TTT GAT TIT ACC AAA TAT AGT 2544 
Lys He Tyr Lys Asp Gly Thr He Lys Phe Asp Phe T!hr Lys Tyr Ser 
1295 1300 1305 1310 

AAA AAT GAA CAA GGA TTA TIT TAT GAC AGT GGA TTA AAT TGG GAC TIT 2592 
Lys Asn Glu-Gln Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe 
1315 1320 1325 

AAA ATT AAT GCT ATT ACT TAT GAT GGT AAA GAG ATS AAT C?IT TTT CAT 2640 
Lys He Asn Ala He Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His 
1330 1335 1340 



AGA TAT AAT AAA TAG 
Arg Tyr Asn Lys 
1345 



2655 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SBQUE1«:E CHARACTERISTICS: 
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(A) LEIOGTH; 884 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SBQUHJJCE DESCRIPTION: SBQ ID NO: 5: 

Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Ihr Cys Thx Leu 
15 10 15 

— -teu Ala Pro Met Phe Leu Asn Gly AaTVal Asn Ala Val Tyr Ala Asp 



Ser Lys Thr Asn Gin lie Ser Thr Thr Gin Lys Asn Gin Gin Lys Glu 
35 40 45 

Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe 
50 55 60 

Ser Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu lie Tyr 
65 70 75 80 

Asp Gin Gin Thr Ala Asn Lys Leu Leu Asp Lys Lys Gin Gin Glu Tyr 
85 90 95 

Gin Ser lie Arg T£p lie Gly Leu lie Gin Ser Lys Glu Thr Gly Asp 
100 105 110 

Phe Thr Phe Asn Leu Ser Glu Asp Glu Gin Ala lie lie Glu lie Asn 
115 120 125 

Gly Lys He He Ser Asn Lys Gly Lys Glu Lys Gin Val Val His Leu 
130 135 140 

Glu Lys Gly Lys Leu Val Pro He Lys He Glu Tyr Gin Ser Asp Ihr 
145 150 155 160 

Lys Phe Asn He Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys 
165 170 175 

He Asp Ser Gin Asn Gin Pro Gin Gin Val Gin Gin Asp Glu Leu Arg 
180 185 190 

Asn Pro Glu Phe Asn Lys Lys Glu Ser Gin Glu Kie Leu Ala Lys Pro 
195 200 205 

Ser Lys He Asn Leu E^ie Thr Gin Lys Met Lys Arg Glu He Asp Glu 
210 215 220 

Asp Thr Asp Thr Asp Gly Asp Ser He Pro Asp Leu Trp Glu Glu Asn 
225 230 235 240 

Gly Tyr Thr He Gin Asn Arg He Ala Val Lys Trp Asp Asp Ser Leu 



20 



25 



30 



245 



250 



255 
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Aia Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His 
260 265 270 

Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu 
275 280 285 

Asp Leu Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe 
290 295 300 

Pro Ser Val Asn Val Ser Met Glu Lys VaLJile Leu Ser Pro Asn Glu 
-305- 310 315 320 

Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser Ihr Asn Trp Ser Tyr 
325 330 335 

Tbr Asn Thr Glu Gly Ala Ser Val Glu Ala Gly lie Gly Pro Lys Gly 
340 345 350 

He Ser Phe Gly Val Ser Val Asn Tyr Gin His Ser Glu Thr Val Ala 
355 360 365 

Gin Glu Trp Gly Tflir Ser Ttrc Gly Asn Thr Ser Gin Ptxe Asn Thr Ala 
370 375 380 

Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr 
385 390 395 400 

Gly Ala He Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn 
405 410 415 

Asp Thr He Ala Thr He Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn 
420 425 430 

He Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gin Asn Gly He Ala 
435 440 445 

He Thr Ser Met Aqp Asp Phe Asn Ser His Pro He Thr Leu Asn Lys 
450 455 460 

Lys Gin Val-Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr 
465 470 475 480 

Asn Gin Thr Asp Gly Val Tyr Lys He Lys Asp Thr His Gly Asn He 
485 490 495 

Val Thr Gly Gly Glu Trp Asn Gly Val He Gin Gin He Lys Ala Lys 

500 505 ' 510 

Thr Ala Ser He He Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg 
515 520 525 

Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu 
530 535 540 
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Thr Leu Lys Asp Ala Leu Lys Leu Ser I'yr Pro Asp Glu lie Lys Glu 
545 550 555 560 

lie Glu Gly Leu Leu Tyr Tyr Lys Asn Lys Pro lie Tyr Glu Ser Ser 
565 570 575 

Val Met Ihr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gin 
580 585 590 

Leu Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp 
595 600 605 

Val Lys Leu TfrTPro Lys Met Asn Val Ihr Jlle Lys Leu Ser He Leu 
610 615 620 

Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser He Gly Lys Trp Thr Asn 
625 630 635 640 

Thr Asn He Val Ser Gly Gly Asn Asn Gly Lys Lys Gin Tyr Ser Ser 
645 650 655 

Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gin Glu Lys 
660 665 670 

Leu Asn Lys Asn Arg Asp Tyr Tyr He Ser Leu Tyr Met Lys Ser Glu 
675 680 685 

Lys Asn Thr Gin Cys Glu He Thr lie Asp Gly Glu He lyr Pro He 
690 695 700 

Thr Thr Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu 
705 710 715 720 

He He Ala His Asn He Lys Ser Asn Pro He Ser Ser Leu His He 
725 730 735 

Lys Thr Asn Asp Glu He Thr Leu Phe Trp Asp Asp He Ser He Thr 
740 745 750 

Asp Val Ala Ser He Lys Pro Glu Asn Leu Thr Asp Ser Glu He Lys 
755 760 765 

Gin He Tyr Ser Arg Tyr Gly He Lys Leu Glu Asp Gly He Leu He 
770 ' 775 780 

Asp Lys Lys Gly Gly He His Tyr Gly Glu Phe He Asn Glu Ala Ser 
785 790 795 800 

E^e Asn He Glu Pro Leu Gin Asn Tyr Val Thr Lys Tyr Glu Val Thr 
805 810 815 

Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp 
820 825 830 

Lys He Tyr Lys Asp Gly Thr lie Lys Phe Asp Phe Thr Lys Tyr Ser 
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835 840 84b 

Lys Asn Glu Gin Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe 
850 855 860 

Lys He Asn Ala He Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His 
865 870 875 880 

Arg Tyr Asn Lys 



(21-INFC«MATI0N FOR SBQ ID IsK>:6: 

(i) SBQUH^ CHARACTERISTICS: 

(A) UBtUTii: 2004 base pairs 

(B) TYPBi nucleic acid 

(C) STRMiSXSXJBSSi single 

(D) TOPOLOGY: linear 

(ii) MDLEXXJLE TYPE: ENA (genomic) 
(iii) HYPOIHErriCAL: NO 
(iv) ANTI-SENSE: NO 

(vi) CXIIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: AB78 

(C) INDIVIDUAL ISOLATE: NRRL B-21058 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2001 

(D) OTOER INPORMATICN: /products "80 kDa protein VIPLA{a)- 
/note= "This sequence is identical to that found in SBQ ID N0:1 
between and including nucleotide positions 3126 and 5126 



(xi) SEQUENCE DESCRIPTION: SBQ ID N0:6: 

ATG AAA AGO GAA ATT GAT GAA GAC ACG GAT ADG GAT GOG GAC TCP ATT 48 
Met Lys Arg Glu He Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser He 
885 890 895 900 

CCT GAC CTT TOG GAA GAA AAT GOG TAT ACG ATT CAA AAT AGA ATC OCT 96 
Pro Asp Leu Trp Glu Glu Asn Gly Tyr Thr He Gin Asn Arg He Ala 
905 910 915 

GTTA AAG TOG GAC GAT TCT CTA GCA AGT AAA GGG TAT AOG AAA TIT GTT 144 
Val Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe Val 
920 925 930 

TCA AAT CCA CTA GAA AGT CAC ACA GTT QCTT GAT OCT TAT ACA GAT TAT 192 
Ser Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr Thr Asp Tyr 
935 940 945 
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GAA AAG GCA GCA AGA GAT CTA GAT TTG TCA AAT GCA AAG GAA ACG TTT 240 
Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr Phe 
950 955 960 

AAC CCA TTG GTA GCT GCT TTT CCA AGT GTG AAT GTT AGT ATG GAA AAG 288 
Asn Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val Ser Met Glu Lys 
965 970 975 980 

GTG ATA TTA TCA CCA AAT GAA AAT TTA TOC AAT AGT GTA GAG TCT CAT 336 
Val lie Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His 
985 990 995 

TCA TCC AOS AAT TGG TCT TAT ACA AAT ACA GAA GGT OCT TCT' 3TT GAA 384 
Ser Ser Ttur Asn Trp Ser Tyr 'Thr Asn Thr Glu Gly Ala Ser Val Glu 
1000 1005 1010 

GOG GGG ATT GGA CCA AAA GGT ATT TOG TTC GGA GTT AGC GTA AAC TAT 432 
Ala Gly He Gly Pro Lys Gly He Ser E^e Gly Val Ser Val Asn Tyr 
1015 1020 1025 

CAA CAC TCT GAA ACA GTT GCA CAA GAA TOG GGA ACA TCT ACA GGA AAT 480 
Gin His Ser Glu Thr Val Ala Gin Glu Trp Gly Thr Ser Thr Gly Asn 
1030 1035 1040 

ACT TOG CAA TTC AAT AOG GCT TCA GOG GGA TAT TTA AAT GCA AAT GTIT 528 
Thr Ser Gin Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val 
1045 1050 1055 1060 

CGA TAT AAC AAT GTA GGA ACT GGT GCC ATC TAC GAT GTA AAA COT ACA 576 
Arg TVr Asn Asn Val Gly Thr Gly Ala lie Tyr Asp Val Lys Pro Thr 
1065 1070 1075 

ACA AGT TIT GTA TTA AAT AAC GAT ACT ATC GCA ACT ATT AO* GOG AAA 624 
Thr Ser Phe Val Leu Asn Asn Asp Thr He Ala Thr He Thr Ala Lys 
1080 1085 1090 

TCT AAT TCT ACA GCC TTA AAT ATA TCT COT GGA GAA AGT TAC COG AAA 672 
Ser Asn Ser Thr Ala Leu Asn He Ser Pro Gly Glu Ser Tyr Pro Lys 
1095 1100 1105 

AAA GGA CAA AAT GGA ATC GCA ATA ACA TCA ATG GAT GAT TTT AAT TCC 720 
Lys Gly Gin Asn Gly He Ala He Thr Ser Met Asp Asp Phe Asn Ser 
1110 1115 1120 

CAT CCG ATT ACA TTA AAT AAA AAA CAA GTA GAT AAT CTG CTA AAT AAT 768 
His Pro He Thr Leu Asn Lys Lys Gin Val Asp Asn Leu Leu Asn Asn 
1125 1130 1135 1140 

AAA CCT ATG ATG TTO GAA ACA AAC CAA ACA GAT GGT GTIT TAT AAG ATA 816 
Lys Pro Met Met Leu Glu *nir Asn Gin Thr Asp Gly Val Tyr Lys He 
1145 1150 1155 

AAA GAT ACA CAT GGA AAT ATA GTA ACT GGC GGA GAA TOG AAT GCT GTC 864 
Lys Asp Thr His Gly Asn He Val Thr Gly Gly Glu Trp Asn Gly Val 
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1160 116b 1170 

ATA CAA CAA ATC AAG GCT AAA ACA GCG TCT ATT ATT GTG GAT GAT GGG 912 
He Gin Gin He Lys Ala Lys Thr Ala Ser He He Val Asp Asp Gly 
1175 1180 1185 

GAA CX3T GTA GCA GAA AAA CGT GTA GCG GCA AAA GAT TAT GAA AAT CCA 960 
Glu Arg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro 
1190 1195 1200 

GAA GAT AAA ACA COG TCT TTA ACT TTA AAA GAT QOC CTG AAG CTT TCA 10D8 
Glu Asp Lys Tbur Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys I^^Ser 
1205 1210 1215 — 1220 

TAT CCA GAT GAA ATA AAA GAA ATA GAG GGA TTA TEA TAT TAT AAA AAC 1056 
Tyr Pro Asp Glu He Lys Glu He Glu Gly Leu Leu Tyr Tyr Lys Asn 
1225 1230 1235 

AAA CCG ATA TAG GAA TCG AGC GTT ATG ACT TAC TTA GAT GAA AAT ACA 1104 
Lys Pro He Tyr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu Asn Thr 
1240 1245 1250 

GCA AAA GAA GTG AOC TtfA CAA TTA AAT GAT ACC ACT GGG AAA TIT AAA 1152 
Ala Lys Glu Val Thr Lys Gin Leu Asn Asp Thr Tbr Gly Lys Phe Lys 
1255 - 1260 1265 

GAT GTA AGT CAT TPA TAT GAT GTA AAA CTG ACT CCA AAA ATS AAT GTT 1200 
Asp Val Ser His Leu Tyr Asp Val Lys Leu Hhr Pro Lys Met Asn Val 
1270 1275 1280 

ACA ATC AAA TIG TCT ATA CTT TAT GAT AAT GCT GAG TCT AAT GAT AAC 1248 
Thr He Lys Leu Ser He Leu Tyr Asp Asn Ala Glu Ser Asn Asp Asn 
1285 1290 1295 1300 

TCA ATT GGT AAA TOG ACA AAC ACA AAT ATT GIT TCA GCT GGA AAT AAC 1296 
Ser He Gly Lys Trp Thr Asn Thr Asn He Val Ser Gly Gly Asn Asn 
1305 1310 1315 

GGA AAA AAA CAA TAT TCT TCT AAT AAT COG GAT GCT AAT TIG ACA TTA 1344 
Gly Lys Lys Gin Tyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr Leu 
1320 1325 1330 

AAT ACA GAT GCT CAA GAA AAA TTA AAT AAA AAT OCT GAC TAT TAT ATA 1392 
Asn Thr Asp Ala Gin Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr He 
1335 1340 1345 

AGT TTA TAT ATG AAG TCA GAA AAA AAC ACA CAA TCT GAG ATT ACT ATA 1440 
Ser Leu Tyr Met Lys Ser Glu Lys Asn Thr Gin Cys Glu He Thr lie 
1350 1355 1360 

GAT GGG GAG ATT TAT CCG ATC ACT ACA AAA ACA GTG AAT GTIO AAT AAA 1488 
Asp Gly Glu He Tyr Pro He Thr Thr Lys Thr Val Asn Val Asn Lys 
1365 1370 1375 1380 

GAC AAT TAC AAA AGA TTA GAT ATT ATA GCT CAT AAT ATA AAA ACT AAT 1536 
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Asp Asn Tyr Lys Arg Leu Asp lie lie Ala His Asn He Lys Ser Asn 
1385 1390 1395 

CCA ATT TCT TCA CTT CAT ATT AAA ACG AAT GAT GAA ATA ACT.TTA TTT 1584 
Pro He Ser Ser Leu His He Lys Thr Asn Asp Glu He Thr Leu Phe 
1400 1405 1410 

TGG GAT GAT ATT TCT ATA ACA GAT GEA GCA TCA ATA AAA CCG GAA AAT 1632 
Trp Asp Asp He Ser lie Thr Asp Val Ala Ser He Lys Pro Glu Asn 
1415 1420 1425 

TPA ACA GAT TCA GAA ATT AAA CAG ATT TAT AGT AGG TAT GCT ATT AAG _ 1680- 
Leu Thr Asp Ser Glu He Lys_Gln He Tyr Sir' Arg Tyr Gly He Lys 
1430 1435 1440 

TTA GAA GAT GGA ATC CTT ATT GAT AAA AAA GGT GGG ATT CAT TAT QGT 1728 
Leu Glu A^ Gly He Leu He Asp Lys Lys Gly Gly He His Tyr Gly 
1445 1450 1455 1460 

GAA TIT ATT AAT GAA GCT AGT TTT AAT ATT GAA CCA TTG CCA AAT TAT 1776 
Glu Phe He Asn Glu Ala Ser Phe Asn He Glu Pro Leu Pro Asn Tyr 
1465 1470 1475 

GTC ACC AAA TAT GAA GIT ACT TAT AGT AGT GAG TTA GGA CCA AAC GTG 1824 
Val Thr Lys Tyr Glu Val Thr Tyr Ser Ser Glu Leu Gly Pro Asn Val 
1480 1485 1490 

AGT GAC ACA CTT GAA AGT GAT AAA ATT TAC AAG GAT GOG ACA ATT AAA 1872 
Ser Asp Thr Leu Glu Ser Asp Lys He Tyr Lys Asp Gly Thr He Lys 
1495 1500 1505 

TTT GAT TTT ACC AAA TAT AGT AAA AAT GAA CAA GGA TTA TTT TAT GAC 1920 
Phe Asp Phe Thr Lys Tyr Ser Lys Asn Glu Gin Gly Leu Phe Tyr Asp 
1510 1515 1520 

AGT GGA TTA AAT TOG GAC TIT AAA ATT AAT GCT ATT ACT TAT GAT QGT 1968 
Ser Gly Leu Asn Ttp Asp ttie Lys He Asn Ala He Thr Tyr Asp Gly 
1525 1530 1535 1540 

AAA GAG ATG AAT GTP TTT CAT AGA TAT AAT AAA TAG 2004 
Lys Glu Met Asn Val Phe His Arg Tyr Asn Lys 
1545 1550 



(2) INFORMATION FOR SBQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEf^oTH: 667 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SBQUEJ3CE DESCRIPTION: SBQ ID NO: 7: 
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Met Lys Arg Glu lie Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser lie 
15 10 15 

Pro Asp Leu Trp Glu Glu Asn Gly Tyr Thr He Gin Asn Arg He Ala 
20 25 30 

Val Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe Val 
35 40 45 

Ser Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr Thr Asp Tyr 
_ 50 55 60 

Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser AsrTAia Lys Glu Thr Phe 
65 70 75 80 

Asn Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val Ser Met Glu Lys 
85 90 95 

Val He Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His 
100 105 no 

Ser Ser Thr Asn Trp Ser Tyr Thr Asn 'Thr Glu Gly Ala Ser Val Glu 
115 120 125 

Ala Gly He Gly Pro Lys Gly He Ser Phe Gly Val Ser Val Asn Tyr 
130 135 140 

Gin His Ser Glu Thr Val Ala Gin Glu Trp Gly Tlir Ser Thr Gly Asn 
145 150 155 160 

Thr Ser Gin Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val 
165 170 175 

Arg Tyr Asn Asn Val Gly Thr Gly Ala He Tyr Asp Val Lys Pro Thr 
180 185 190 

Thr Ser Phe Val Leu Asn Asn Asp Thr He Ala Thr He Thr Ala Lys 
195 200 205 

Ser Asn Ser Thr Ala Leu Asn He Ser Pro Gly Glu Ser Tyr Pro Lys 
210 215 220 

Lys Gly Gin Asn Gly He Ala He Thr Ser Met Asp Asp Phe Asn Ser 
225 230 235 240 

His Pro He Thr Leu Asn Lys Lys Gin Val Asp Asn Leu Leu Asn Asn 
245 250 255 

Lys Pro Met Met Leu Glu Thr Asn Gin Thr Asp Gly Val Tyr Lys He 
260 265 270 

Lys Asp Thr His Gly Asn He Val Thr Gly Gly Glu Trp Asn Gly Val 
275 280 285 



He Gin Gin He Lys Ala Lys Thr Ala Ser He He Val Asp Asp Gly 
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290 295 300 

Glu Arg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro 
305 310 315 320 

Glu Asp Lys Thr Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu Ser 
325 330 335 

Tyr Pro Asp Glu He Lys Glu He Glu Gly Leu Leu Tyr Tyr Lys Asn 
340 345 350 

Lys Peo He Tyr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu AsnJIhr 
355 360 _165 

Ala Lys Glu Val TUnr Lys Gin Leu Asn Asp Thr Thr Gly Lys Phe Lys 
370 375 380 

Asp Val Ser His Leu Tyr Asp Val Lys Leu Thr Pro Lys Met Asn Val 
385 390 395 400 

Thr He Lys Leu Ser He Leu Tyr Asp Asn Ala Glu Ser Asn Asp Asn 
405 410 415 

Ser He Gly Lys Trp Thr Asn Thr Asn He Val Ser Gly Gly Asn Asn 
420 425 430 

Gly Lys Lys Gin lyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr Leu 
435 440 445 

Asn Thr Asp Ala Gin Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr He 
450 455 460 

Ser Leu Tyr Met Lys Ser Glu Lys Asn Thr Gin Cys Glu He Thr He 
465 470 475 480 

Asp Gly Glu He Tyr Pro He Thr Thr Lys Ihr Val Asn Val Asn Lys 
485 490 495 

Asp Asn Tyr Lys Arg Asp He He Ala His Asn He Lys Ser Asn 
500 505 510 

Pro He Ser Ser Leu His He Lys-Thr Asn Asp Glu He Thr Leu Phe 
515 520 525 

Ttp Asp Asp He Ser He Thr Asp Val Ala Ser He Lys Pro Glu Asn 
530 535 540 

Leu Thr Asp Ser Glu He Lys Gin He Tyr Ser Arg Tyr Gly He Lys 
545 550 555 560 

Leu Glu Asp Gly He Leu He Asp Lys Lys Gly Gly He His Tyr Gly 
565 570 575 

Glu Phe He Asn Glu Ala Ser Phe Asn He Glu Pro Leu Pro Asn Tyr 
580 585 590 
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Val Thr Lys Tyr Glu Val Thr Tyr Ser Ser Glu Leu Gly t>ro Asn Val 
595 600 605 

Ser Asp Thr Leu Glu Ser Asp Lys lie Tyr Lys A^ Gly Thr He Lys 
610 615 620 

Phe Asp Phe Thr Lys Tyr Ser Lys Asn Glu Gin Gly Leu Phe Tyr Asp 
625 630 635 640 

Ser Gly teu Asn Trp Asp Phe Lys He Asn Ala He Thr Tyr Asp Gly 

645 ^ 650 655 

Lys Glu Met Asn Val Phe His Arg Tyr Asn Lys 
660 665 

(2) INFORMATIGN FOR SE3Q ID N0:8: 

(i) SEQUENCE CHARACTEEUSTICS: 

(A) LBC^KaTXH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEENESS: single 

(D) TOPOLCXAT: linear 

(ii) MOLECULE TVTPE: peptide 
(iii) HYPCmffinCAL: 

(v) FIWSMENT TOTE: N-tenninal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN; AB78 

(C) INDIVIDUAL ISOLATE: NRRL B-21058 

(ix) FEATORE: 

(A) NAME/raY: Peptide 

(B) LOCATION: 1..16 

(D) OTHER INPDRMATICN: /note= "N-terminal sequence of 
protein purified from strain AB78" 



(xi) SBQUEICE DESCRIPTION: SBQ ID NO: 8: 

Lys Arg Glu He Asp Glu Asp Ihr Asp Ihr Asx Gly Asp Ser He Pro 
15 10 15 



(2) INPORMATIOa FOR SBQ ID N0:9: 

(i) SBQUENCTE CHARACTERISTICS: 

(A) LEN3TH; 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CNA (genomic) 
(iii) HYPOTOETICAL: NO 
(iv) AOTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: misc^feature 

(B) LOCATIOa: 1..21 

(D) OIHER 3OTORMATI0N:_/nefee= "Oligonucleotide probe based 
on amino acids 3 to 9 of SEQ ID NO: 8, using codon usage of 
Bacillus thuringiensis" 



(xi) SBQUEJICE DESCRIPTICN: SEQ ID NO: 9: 
GAAATTGATC AAGATAQJ3A T 21 
(2) INFORMATICN FOR SBQ ID N0:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENSTH: 14 amino acids 

(B) TfPE: amino acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(v) FRAGMEOT TYPE: N-temdncd 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: AB88 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATIC^J: 1..14 

(D) OTHER INFORMATION: /nDte= "N-terminal amino acid 
sequence of protein kn<Mi as anion exchange fraction 23 
(smaller) " 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 10: 

Xaa Glu Pro Phe Val Ser Ala Xaa Xaa Xaa Gin Xaa Xaa Xaa 
15 10 

(2) INFORMATION FOR SBQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
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(B) TYPE: amino acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: N-temiinal 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 



(xi) SEQJENCE DESCMPTION: SEQ ID NO: 11: 

Xaa Glu T^Glu Asn Val Glu Pro ftie Valr'Ser Ala Xaa 
1 5 10 

(2) INFORMATiai FOR SBQ ID NO: 12: 

(i) SBQUE^3CE CHARACTERISTICS: 

(A) LEN3TO: 14 amino acids 

(B) TYPE: amino acid 

(C) SIKANDECNESS: single 

(D) TOPOLOGY: N-terxninal 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thurigiensis 



(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 12: 

Met Asn Lys Asn Asn Hir Lys Leu Pro Thr Arg Ala Leu Pro 
15 10 

(2) INFORMATION FOR SBQ ID NO: 13: 

(i) SBQOaiCE CHARACTBRISnCS: 

(A) LENGriH: 15 amino acids 

(B) TYPE: amino acid 

(C) STOANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOraETICAL: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: AB88 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..15 
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(D) OTOER INFORMATIOI: /note= "N- terminal amino acid 
sequence of 35 kDa VIP active against Agrotis ipsilon" 



(xi) SEQUENCE DESCRIPTIOa: SBQ ID NO: 13: 

Ala Leu Ser Glu Asn Thr Gly Lys Asp Gly Gly lyr He Val Pro 
1 5 10 15 

(2) INFORMATION FOR SBQ ID NO: 14: 

(i) SEQUENCE CHftf»OERISTICS : — 
^"tft) I^JKTTH: 9 amino acids_^ 

(B) TCPE: amino acid 

(C) STOANCBKRSS; single 

(D) TOPOLOGY: N-terndnal 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 



(xi) SBQUH^ ASCRIPTION: SBQ ID NO: 14: 

Met Asp Asn Asn Pro Asn lie Asn Glu 
1 5 

(2) INFtDRMATION FOR SBQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: andno acid 

(C) STRPi^XMXiJESSz single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOraETICAL: NO 

(v) FRAGMENT TYPE: N-teritiinal 



(ix) FEATORE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..9 

(D) OIHER INFORMATION: /note= "N-terminal sequence of 80 
kDa delta-endotoxin" 



(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 15: 

Met Asp Asn Asn Pro Asn He Asn Glu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 16: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TVPE: amino acid 

(C) ffTOANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE; N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(ix) FEA1URE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..11 

(D) OTHER INFXaRMATION: /note= "N-tenninal sequence from 60 
kDa delta-endotoxin" 



(xi) SEQUENCE nESCRIPTICN: SBQ ID NO: 16: 

Met Asn Val Leu Asn Ser Gly Arg Thr Thr He 
15 10 

(2) INFORMATION FOR SBQ ID N0:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEH3IH: 2655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDETNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: rWA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AOTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..2652 

(D) OTHER INFORMATION: /note= "Maize optimized ENA 
sequence for 100 kd VIPIA(a) protein from AB78" 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 17: 
ATGAAGAACA TGAAGAAGAA GCTGGCCAGC GTQGTSACCT GCACCCTX3CT GGCCCCCATX3 



60 
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TICCTCAAai 


GCAACX5TGAA 


CGCXX3TGTAC 


GCOSACAGCA- AGACCAACCA 


GATCAGCACC 


120 


ACCCAGAAGA 


ACCAGCAGAA 


GGAGATGGAC 


OGCAAGGQCr TQCTGQGCTA 


CTACTTCAAG 


180 


GGCAAGGACT 


TCAGCAACXrr 


GACCATGTTC 


GCCXXX:ACX3C gtcacaqcac 


CCTGATCTAC 


240 


GACCAOCAGA 


CCGCCAACAA 


GCTOCTQGAC 


AAGAAGCAGC AGGAGTACX:A 


GAGCATCOX: 


300 


TQGATCQGCC 


TCAT(XAGAG 


CAAGGAGACC 


QGOGACITCA OCTTCAACCT 


GAGOGAGGAC 


360 


GAOCAGGCTA 


TCATOGAGAT 


CAAOGGCAAG 


ATCATCAGCA ACAAGGGCAA 


GGAGAAGCAG 


420 


GTGGIGCACC 


TGGAGAAGGG 


CAAGCTGGTG 


C3CCATCAAGA TCC3AGTA0CA 


GAGC3GACACC 


480 


AAGITCAACA 


TCGACAGCAA 


GACCITCAAG 


GAGCTGAAGC TTTTCAAGAT 


CX3ACAGCCAG 


540 


AACCAQOCrC 


AGCAGGflGCA 


GCAGGAiOGAG 


CTQCGCAACC CXXSAGrTTCAA 


CAAGAAGGAG 


600 


AGCXilAGGAGT 


TCCTQQCCAA 


GCa:AGCAAG 


ATCAACCICT TCACCCAGCA 


GATGAAGOGC 


660 


GAGATCGACG 


AGGACACOGA 


CACOSAOGGC 


GACAGCATOC CXX3ACCTGTX3 


GGAGGAGAAC 


720 


GGCTACACCA 


TCCAGAACC3G 


CAHOGCXXaTC 


AAGTOQGADG ACAQCTTOGC 


TAGCAAGGGC 


780 


TACACCAACT 


TCGTGAGCAA 


CCCCCTQGAG 


AGCCACACTG TCGGOGACCC 


CTACACCGAC 


840 


TACGAGAAGG 


CCX30Ca3CX3A 


CCTGGACCTG 


AGCAAlOGCCA AGGAGAOCTT 


CAACXXXXTC 


900 


GTGQCXX3CCT 


TCCCXIAGOGr 


GAACX?IGAGC 


ATGG?^3AAGG TGAIXXTGAG 


CaXAAOGAG 


960 


aacx:tgagc::a 


ACAGOGTIGGA 


GA0C3CACTCG 


AGCAOCAACT OGAGCTACAC 


CAACACCGAG 


1020 


QGCX3CCAGCG 


TGGAGGCXXSG 


C3mX3GTCCC 


AAQQQCATCA GCTTCQOOGT 


GAGQGTCAAC 


1080 


TACCAGCACA 


GCGAGACCGT 


QGCCCAGGAG 


TQOQOCACXIA OCACXX3GCAA 


CACCAGCCAG 


1140 


TICAACACXDG 


CCAGOGCXDGG 


CTACCTGAAC 


OaCAAaSfTQC GCTACAACAA 


CGiasGCAo: 


1200 


QQCX30CATCT 


ACGAOTIGAA 


QCOCACCAOC 


AQCTTOGOQC TGAACAAC3GA 


CACCATOQCC 


1260 


ADCATCACOG 


CCAAGrrOGAA 


ITCCAOOQCX: 


CTGAACATCA GCCCGQGOGA 


GAGCTACCOC 


1320 


AAGAAGGGCC 


AGAAOQOCAT 


CXXXATCACX: 


AGCATQGAOG ACTTCAACAG 


CCACCCCATC 


1380 


ACCCTGAACA 


AGAAGCAQGT 


QGACAACCTG 


CTCAACAACA AGCCCATCAT 


GCTGGAGACC 


1440 


AACCAGACCX3 


Aa3Q0GfICrA 


CAAGATCAAG 


GACACCCAOS GCAACATCGT 


GACOQQOGQC 


1500 


GAGIGGAACG 


GOGTGATGCA 


GCAGATCAAG 


G0CAAGACCX3 CX^GOVTCAT 


CXSnXXSACGAC 


1560 


GGOSAGCXXXS 


TQGCXXAGAA 


QOQCXSTQGCXr 


GCCAAQGACr AOGAGAACXX! 


OGAGGACAAG 


1620 


ACCCOCAGCC 


TGACCCTCAA 


QGAOSCCCTG 


AAGCTGAGCT ACCXXXSACX3A 


GATCAAGGAG 


1680 


ATOGAQGGCC 


TQCTGTACTA 


CAAGAACAAG 


C3CCATCTACG AGAQCAGCGT 


GATCACCTAT 


1740 
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CTAGAOGAGA PiCPCCGCCPA GGAGGTCACC AAGCAGCTGA ACGACACCAC CXSGCAACJITC 1800 

AAGGACX3TGA GCCAOCTGTA CXSACXTTGAAG CTGACCCXXIA AGATGAACX3T GACCATCAAG 1860 

CTGAGCATCC TGTACGACAA CX3COC5AGAGC AACGACAACA QCATCQGCAA GTOGACCAAC 1920 

ACXZAACATCG TGAGCX3CXX3G CAACAACGGC AAGAAGCAGT ACAGCAGCAA CAACXXOCSAC 1980 

QCCAAOCTGA CCCTGAACAC OGAOXXXaG GAGAAGCTOA ACAAGAACCG CGACTACTAC 2040 

ATCMCX^ICT ACATCAftaC OGAGAAGAA£.AOCX»GTOro AGATCACCAT CXaeOGOGAG 2100 

ATATACCXXA TCAOCAOCAA GACCC?K3AflC GTOAACAAGG ACAACTACAA QCGCCTOGAC 2160 

ATCATCQQCC ACAACATCAA GAGCAACCCC ATCAGCAGCX: TGCACATCAA GACCAACX5AC 2220 

GAGATCACCC TCTTCTOQGA OGACATATOG ATTACCXSACG TOGOCAGCAT CAAGCCXX5AG 2280 

AACCIGAOCG ACAGCX3AGAT CAAGCAGATA TACAGTOC3CT AOSGCATCAA QCTOGAGCaC 2340 

QCXIATCCIGA TOGACAAGAA GGQOQQCATC CACTAOSGCG AGTITCATCAA CGAQQCCAGC 2400 

TTCAACATOG AGCCCCTOCA GAACTACX3TG ACCAACTACXS AQGTOACCTA CAGCAQCGftG 2460 

CTQQCXXXXA ACX3TGAQCGA CAOXTQGAG AGCXSACAAGA TTTACAAOGA C3GGCACCATC 2520 

AAGTTCGACT TCACCAAGTA CAGCAAGAAC GAGCAGQGCX: TOITCTACGA CAGCQQCXnG 2580 

AACTQQGACT TCAAGATCAA CXXTATCACC TACGACGQCA AGGAGATCAA CX?ICTIXXAC 2640 
CGCTACAACA AGTAG 

(2) INPORMATION FOR SBQ ID NO: 18: 

(i) SBQUElKrE CHARACTEEUSnCS : 

(A) LENGTH: 2004 base pairs 

(B) TVTE: nucleic acid 

(C) STOANMTNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ENA (genonic) 
liii) HYPOIHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATORE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..2004 

(D) OTOER INPORMATION: /note= "Maize optiitdzed CNA 
sequence for VIPlA(a) 80 kd protein from AB78" 



2655 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 18: 
GAOSftGAACX; CXTTACAOCAT CX^^ 
T;CAO0GACr ACXSAGAAC^C OOXr^ 
^"^'^ ^^^CIOAGCAA CAGO^TOGAG^miAC^ 

-^^CAcxx^ ooxx:^ 
^^«x^Aa:A«:Ac^ 

^^OCAGCCA^^ TCAACACCX^ 

c^cco cxxKXAicTA as^^ont^ 
Aa:A.axrAocA.^cA«^T.a:Aa^ 

A0CIV«XXX:A AGAAGQOa:^ 

CAOcccATCA axrix^ 

C^CC^ GOGAGOCXXn. CmxS^ 

^^^^^ ccccc^cxxrr ^^^^ 
-^c^To..aaxx:TGcr«^Ac^^ ^^^^ 

^«^ATC ..^.SaSAGAA 
Q^CAACmc. ACX^AOCnUAG 

ACCAlTAAa: OX^TOCT C^AOaCAAC 
^«^«^OCAACA CXrAACATO^ 

^^OXXXSAC^ OCAACCK^ ^^^^ 
<^AC^ .CAOCCK^A CAOt^ 0,^^ ^^^^ 

««^^tataccxx:atcaoc^ ^^^^ 

COOCna^ TCAinxXTA CAAC.T^ ^^^^ 

^^xiAAoaoG AGATCACxrr (nixn^ ^^^^ 



780 
840 



1140 

i2Qa_ 

1260 
1320 
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AAGCCCX3AGA ACCTGACCX3A CAGCX5AGATC AAGCAGATAT ACAGTCGCTA CGGCATCAAG 1680 

CTGGAGGAO; GCATOCTGAT OGACAAGAAG QQOGQCATCC ACTAOSGX^GA C?ITCATCAAC 1740 

GAGGOCAGCT TCAACATCGA GCCrCTQCAG AACTACGTGA CCAAGTACGA C3GTCACCTAC 1800 

AGCAGCGAGC TOGGCXXCAA CGTCAGCX5AC ACCCTGGAGA GCGACAAGAT TTACAAQGAC I860 

QC3CACCATCA AGTTOSACTT CACTAAGTAC AQCAAGAACG AGCAGC3GCCT CSTTCTACGAC 1920 

AGOC5C3CXTCA ACIX3QCSAC1T CAAGATCA^. GCCATCACcf' ACGACQGCAA GGAGKIfGAAC 1980 

GTCTTOCAOC GCTACAACAA GTOG 2004 
(2) INFORMATION PCH SBQ ID ND:19: 

(i) SBQUENC:E CHARACTERISTICS: 

(A) LEWSriH: 4074 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLXXSY: linear 

(ii) MOLECULE TYPE: £NA (genondc) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1386 

(D) OraER INFOR^4ATION: /product^ "VIP2A(b) from Btt" 

(ix) FEAHIRE: 

(A) NRME/KEY: CDS 

(B) LOCATION: 1394.. 3895 

(D) OIHER INFORMATION: /prcduct= "VIPlA(b) from Btt" 

(ix) FEATORE: 

(A) NAME/KEY: inisc_feature 

(B) LOCATION: 1..4074 

(D) OTHER INFORMATION: /note= "Cloned ENA sequence from 
Btt \ahich contains the gaies for both VIPlA(b) and VIP2A(b) " 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 19: 

ATG CAA AGA ATG GAG QGA AAG TTG TTT GTG GTG TCA AAA ACA TEA CAA 48 
Met Gin Arg Met Glu Gly Lys Leu Phe Val Val Ser Lys Thr Leu Gin 
670 675 680 

GTA GTT ACT AGA ACT CTA TTG CTT AGT ACA GTT TAC TCT ATA ACT TTA 96 
Val Val Thr Arg Thr Val Leu Leu Ser Ite Val Tyr Ser He Thr Leu 
685 690 695 

TTA AAT AAT GTA GTG ATA AAA GCT GAC CAA TTA AAT ATA AAT TCT CAA 144 
Leu Asn Asn Val Val He Lys Ala Asp Gin Leu Asn He Asn Ser Gin 
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700 705 710 715 

AGT AAA TAT ACT AAC TTG CAA AAT CTA AAA ATC CCT GAT AAT GCA GAG 192 
Ser Lys TVr Thr Asn Leu Gin Asn Leu Lys lie Pro Asp Asn Ala Glu 

720 725 730 

GAT TTT AAA GAA GAT AAG GOG AAA GCX5 AAA GAA TOG GGG AAA GAG AAA 240 
Asp Phe Lys Glu Asp Lys Gly Lys Ala Lys Glu Trp Gly Lys Glu Lys 
735 740 745 

GOG GAA GAG TGG AGG CCT CCT GCT ACT GAG AAA GGA GAA ATG AAT AAT 288 
Gly Glu Glu Trp Arg Pro Pro Ala Thr Glu Lys Gly Glujtet-Asn Asn 
750 755 760 

TTT TTA GAT AAT AAA AAT GAT ATA AftG ACC AAT TAT AAA GAA ATT ACT 336 
Phe Leu Asp Asn Lys Asn Asp lie lys Thr Asn Tyr Lys Glu lie Thr 
765 770 775 

TTT TCT ATG GCA GGT TCA TGT GAA GAT GAA ATA AAA GAT TTA GAA GAA 384 
Phe Ser Met Ala Gly Ser Cys Glu Asp Glu lie Lys Asp Leu Glu Glu 
780 785 790 795 

ATT GAT AAG ATC TTT GAT AAA QCC AAT CTC TOG AGT TOT ATT ATC ACC 432 
lie Asp Lys lie Phe Asp Lys Ala Asn Leu Ser Ser Ser lie lie Thr 
800 805 810 

TAT AAA AAT GTC GAA CCA GCA ACA ATT GGA TTT AAT AAA TCT TTA ACA 480 
Tyr Lys Asn Val Glu Pro Ala Thr lie Gly Phe Asn Lys Ser Leu Ihr 
815 820 825 

GAA GGT AAT ACQ ATT AAT TCT GAT GCA ATG GCA CAG TTT AAA GAA CAA 528 
Glu Gly Asn Thr lie Asn Ser Asp Ala Met Ala Gin Phe Lys Glu Gin 
830 835 840 

TIT TTA GGT AAG GAT ATG AAG TTT GAT AGT TAT CTA GAT ACT CAT TTA 576 
Rie Leu Gly Lys Asp Met Lys Phe Asp Ser Tyr Leu Asp Thr His Leu 
845 850 855 

ACT GCT CAA CAA GTT TCC AGT AAA AAA AGA GTT ATT TTG AAG GTT ACG 624 
Thr Ala Gin Gin Val Ser Ser Lys Lys Arg Val He Leu Lys Val Thr 
860 865 870 875 

GTT OCG AGT GGG AAA GGT TCT ACT ACT CCA ACA AAA GCA GGT GTC ATT 672 
Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val He 
880 885 890 

TTA AAC AAT AAT GAA TAC AAA ATG CTC ATT GAT AAT GGG TAT GTG CTC 720 
Leu Asn Asn Asn Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Val Leu 
895 900 905 

CAT GTA GAT AAG CTA TCA AAA GTA GTA AAA AAA GOG ATG GAG TGC TTA 768 
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Met Glu Cys Leu 
910 915 920 

CAA GTT GAA GGG ACT Tl'A AAA AAG AGT CTC GAC TIT' AAA AAT GAT ATA 816 
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Gln Val Giu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He 
92b 930 935 

AAT OCT GAA GCG CAT AGC TOG GGG ATG AAA ATT TAT GAA GAC TGG GCT 864 
Asn Ala Glu Ala His Ser Trp Gly Met Lys He Tyr Glu Asp Trp Ala 
940 945 950 955 

AAA AAT TTA ACC GCT TCX5 CAA AGG GAA GCT TTA GAT GGG TAT GCT AGG 912 
Lys Asn Leu Thr Ala Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Aig 
960 965 970 

CAA GAT TAT AAA_GAA ATC AM AAT TAT TTG C:X3e-ftAT CAA QGC GGG AGT 960 

^in Asp lyr Lys^Glu He Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser 
975 980 985 

GGA AAT GAA AAG CTG GAT GOC CAA TTA AAA AAT ATT TCT GAT GCT TEA 1008 
Gly Asn Glu Lys Leu Asp Ala Gin Leu Lys Asn He Ser Asp Ala Leu 

990 995 1000 

GGG AAG AAA CCC ATA CCA GAA AAT ATT ACC GTG TAT AGA TOG TGT QGC 1056 
Gly Lys Lys Pro He Pro Glu Asn He Thr Val Tyr Arg Trp Cys Gly 
1005 1010 1015 

ATG COG GAA TIT GGT TAT CAA ATT AGT GAT CCG TTA OCT TCT TTA AAA 1104 
Met Pro Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys 
1020 1025 1030 1035 

GAT TTT GAA GAA CAA TTT TTA AAT ACA ATT AAA GAA GAC AAA GGG TAT 1152 
Asp Phe Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly Tyr 
1040 1045 1050 

ATG AGT ACA AGC TTA TOG AGT GAA OCT CTT GCA GCT TTT GGA TCT AGA 1200 
Met Ser Ihr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Axg 
1055 1060 1065 

AAA ATT ATA TTA OQC TTA CAA GTT OCXS AAA OGA AGT AOG QQG GCG TAT 1248 
Lys He He Leu Arg Leu Gin Val Pro Lys Gly Ser Thr Gly Ala Tyr 
1070 1075 1080 

TTA ACT QCC ATT GCT GGA TTT OCA ACT GAA AAA GAG ATC CTA CTT GAT 1296 
teu Ser Ala He Gly Gly Phe Ala Ser Glu Lys Glu He -Leu Leu A^ 
1085 1090 1095 

AAA GAT ACT AAA TAT CAT ATT GAT AAA GCA ACA GAG CTA ATC ATT AAA 1344 
Lys Asp Ser Lys Tyr His He Asp Lys Ala Thr Glu Val He He Lys 
1100 1105 1110 1115 

GCT GTT AAG CGA TAT CTA GTG GAT GCA ACA TTA TTA ACA AAT 1386 
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn 
1120 1125 

TAAGGAG ATG AAA AAT ATC AAG AAA AAG TTA GCA ACT GTT CTA ACC TCT 1435 
Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys 
15 10 
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ATG TTA TTA GOT CO' ATC TTV ITC AAT GGA AAT GIG AAT GCT C?rr AAC 
Met Leu I^u Ala l>ro Met Pha Leu Asn Gly Asn Val Asn SI fal 2n '''' 
" 20 25 30 

GOT GAT ACT AAA ATA AAT CAG ATT TCT ACA ACG CAG GAA AAC CAA CAG 
Ala ASP Ser Lys lie Asn Gin He Ser Oto- Ihr Gin Glu '''' 
35 40 45 

WW^GMMCGACCGAAAGGGATTArrGGGATATTATTICAAAGGAAAA 
Lys Glu Met ASP Arg Lys Gly Leu Gly TVr OVr Rie ^ 

TTT AATTSST CTT ACT ATG TTT GCA CCG ACA CCT GAT MT ACC CPT 
^ «ie Asn Asn Ihr MetPhe Ala Pro Ihr Arg A^ Asnlhr Leu 
65 70 75 

ATG TAT GAG CAA CAA ACA GOG AAT GCA TTA TTA GAT AAA AAA CAA CAA 
Met IVr A^ Gin Gin Ohr Ala .Asn Ala Leu Leu ASP ^ 

85 90 



GAA TAT CM TCC ATT OCT ATT GC?r m ATT GAG OCT A^ 

Glu Tyr Gin Ser He Arg He Gly Leu He Gin ^ 5^ ^ 

100 105 110 

?f ^ ^ ™ ^ GAA GAG GCA ATT ATA GAA 
Gly Asp Phe Thr Phe Asn I^u Ser Lys Asp Glu Gin Ala lie SI Si 
115 noft 



115 120 125 

SSS°f^?T^^?"^^''^«^^^AflGCAAGrrGIC 
He Asp Gly Lys He He Ser Asn Lys Gly lys Glu Lys Gin vS Si 

135 140 

CAT WA G?A AAA GAA AAA TTA GTT CCA ATC AAA ATA GAG TAT CAA TCA 
HIS Glu Lys Glu Lys Leu Val Pro He Lys He Glu 5^ 2^ 

150 155 



GAT AOT AAA m AAT ATT GAT ACT AAA ACA TIT AAA GAA err AAA TTA 

^ ^ Lys Phe Asn He A^ ser Lys Ihr a,e Lys £2 S 

165 170 

TTOAAAATAGATACTCAAAACCAATCrCAACAAOT-CAACTCAGAAA^ 
Phe Lys He Asp Ser Gin Asn Gin Ser Gin Gin Val 2S ^ ^ 

185 190 

OTGAATTrAACAAAAAAGAATCACAGGAATrrTTAGCAAAAQCATrA 
Pro Glu Phe Asn Lys Lys Glu Ser Gin Glu Phe S ^ 

195 200 205 

AAA Att AAC CTT TTT AAG CAA AAA ATG AAA AGA GAT ATT GAT GAA GAT 
Lys 3hr Asn Leu ae Lys Gin Lys Met Lys Arg Asp He ^ ^ 
21" 215 220 

ACG GAT ACA GAT GGA GAC TOC ATT OCT GAT err TOG GAA GAA AAT ay- 
•nu: ASP Tbr ASP Gly ASP ser He Pro ASP Leu 1^ ^ 
225 230 235 



1579 



1627 



1675 



1723 



1771 



1819 



1867 



1915 



1963 



2011 



2059 



2107 
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TAc Acx:; Arr caa aat aaa gtv got gtc aaa tog gat gat tcg cta gca 

Tyr Thr lie Gin Asn Lys Vai Aia Vai Lys Trp Asp Asp Ser Leu Ala 
240 245 250 



2155 



AGT AAG QGA TAT ACA AAA TIT GTT TCX3 AAT CCA TTA GAC AGO CAC ACA 
Ser Lys Gly Tyr Thr Lys Phe Vai Ser Asn Pro Leu Asp Ser His Thr 
255 260 265 270 

GTT QGC GAT CCC TAT ACT GAT TAT GAA AAG GOC OCA AGG GAT TTA GAT 
Val_Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala_Ala Arg Asp Leu Asp 
275 280 285 

TTA TCA AAT GCA AAG GAA AOG TIC AAC CCA TTG GTA GCT GCT TIT CCA 
Leu Ser Asn Ala Lys Glu Tte Phe Asn Pro Leu Vai Ala Ala Phe Pro 
290 295 300 

AGT GTG AAT GTT AGT ATG GAA AAG GTG ATA TTA TCA CCA AAT GAA AAT 
Ser Vai Asn Vai Ser Met Glu Lys Vai He Leu Ser Pro Asn Glu Asn 
305 310 315 



2203 



2251 



2299 



2347 



TTA TCC AAT AGT GTA GAG TCT CAT TCA TCC AOG AAT TGG TCT TAT ACG 2395 
Leu Ser Asn Ser Vai Glu Ser His Ser Ser Thr Asn Trp Ser Tyr Thr 
320 325 330 

AAT ACA GAA GGA OCT TCC ATT GAA QCTOGTOGCGGTCCATTAGGCCTr 2443 
Asn Thr Glu Gly Ma Ser He Glu Ala Gly Gly Gly Pro Leu Gly Leu 
335 340 345 350 

TCT TTT GGC GTG AGT GTT ACT TAT CAA CAC TCT GAA ACA GTT GCA CAA 2491 
Ser Phe Gly Vai Ser Vai Thr Tyr Gin His Ser Glu Thr Vai Ala Gin 
355 360 365 

GAATGGGGAACATCTACAGGAAATACTTCACAAITCAATACGGCTI^ 2539 
Glu Trp Gly 'Rir Ser Thr Gly Asn thr Ser Gin Phe Asn Thr Ala Ser 
370 375 380 

QCG GGA TAT TTA AAT OCA AAT GTT COG TAT AAC AAT GTA GOG ACT GC?r 2587 
Ala Gly Tyr Leu Asn Ala Asn Vai Arg Tyr Asn Asn Vai Gly Thr Gly 
385 390 395 

GCC ATC TAT GAT GTA AAA CCT ACA ACA AGT TTT GTA TTA AAT AAC AAT 2635 
Ala He Tyr Asp Vai Lys Pro Ihr Thr Ser Phe Vai Leu Asn Asn Asn 
400 405 410 

AOC ATC GCA ACG ATT ACA GCA AAA TCA AAT TCA ACA GCT TTA CGT ATA 2683 
Thr He Ala Thr He Thr Ala Lys Ser Asn Ser Thr Ala Leu Arg He 
415 420 425 430 

TCT CCG GOG GAT AGT TAT CCA GAA ATA GGA GAA AAC GCT ATT GCG ATT 2731 
Ser Pro Gly Asp Ser Tyr Pro Glu He Gly Glu Asn Ala He Ala He 
435 440 445 



ACA TCT ATC GAT GAT TTT AAT TCT CAT CCA ATT ACA TTA AAT AAA CAA 
Thr Ser Met Asp Asp Phe Asn Ser His Pro He Thr Leu Asn Lys Gin 



2779 
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450 455 460 

CAG GTA AAT CAA TTG ATA AAT AAT AAG CCA ATT ATG CTA GAG ACA GAC 2827 
Gin Val Asn Gin Leu lie Asn Asn Lys Pro lie Met Leu Glu Ihr Asp 
465 470 475 

CAA ACA GAT OCT GTT TAT AAA ATA AGA GAT ACA CAT QGA AAT ATT GTA 2875 
Gin Thr Asp Gly Val Tyr Lys lie Arg Asp Ihr His Gly Asn He Val 
480 485 490 

ACT GGT GGA GAA TOG AAT GGT GTA ACA CAA CAA ATT AAA GCA AAA ACA 2923 
Thr 'eiy'Gly Glu Trp Asn Gly Val Thr Gin Gin lie Lys Ala Lys Tlir 
495 300 505 510 

GCG TCT ATT ATT GTG GAT GAC GOG AAA CAG GTA GCA GAA AAA OGT GTG 2971 
Ala Ser He He Val A^ Asp Gly Lys Gin Val Ala Glu Lys Arg Val 
515 520 525 

GCG GCA AAA GAT TAT GGT CAT CCA GAA GAT AAA ACA CCA OCT TTA ACT 3019 
Ala Ala Lys Asp Tyr Gly His Pro Glu Asp Lys Tinr Pro Pro Leu Thr 
530 535 540 

TTA AAA GAT AOC CTG AAG CIT TCA TAC CCA GAT GAA ATA AAA GAA ACT 3067 
Leu I^s Asp Ihr Leu Lys Leu Ser Tyr Pro A^p Glu He Lys Glu Ttor 
545 550 555 

AAT GGA TIG TIG TAC TAT GAT GAC AAA CCA ATC TAT GAA TOG ACT GTC 3115 
Asn Gly Leu Leu Tyr Tyr Asp Asp Lys Pro He TVr Glu Ser Ser Val 
560 565 570 

ATG ACT TAT CTG GAT GAA AAT AOG OCA AAA GAA GTC AAA AAA CAA ATA 3163 
Met Thr Tyr Leu A^ Glu Asn Ihr Ala Lys Glu Val Lys Lys Gin He 
575 580 585 590 

AAT GAT ACA ACC GGA AAA TTT AAG GAT CTA AAT CAC TTA TAT GAT CTA 3211 
Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Asn His Leu lyr Asp Val 
595 600 605 

AAA CTG ACT CCA AAA ATG AAT TTT AOG ATT AAA ATG GCT TCC TIG TAT 3259 
Lys Leu Thr Pro Lys Met Asn Phe Thr He Lys Met Ala Ser Leu Tyr 
610 615 620 

GAT GGG GCT GAA AAT AAT CAT AAC TCT TTA GGA ACC TOG TAT TTA ACA 3307 
Asp Gly Ala Glu Asn Asn His Asn Ser Leu Gly Thr Trp Tyr Leu Thr 
625 630 635 

TAT AAT GTT GCT OCT GGA AAT ACT GOG AAG AGA CAA TAT OCT TCA GCT 3355 
Tyr Asn Val Ala Gly Gly Asn Thr Gly Lys Arg Gin Tyr Arg Ser Ala 
640 645 650 

CAT TCT TCT GCA CAT CTA GCT CTA TCT TCA GAA GOG AAA AAG AAA CTA 3403 
His Ser Cys Ala His Val Ala Leu Ser Ser Glu Ala Lys Lys Lys Leu 

655 660 665 670 

AAT CAA AAT GCG AAT TAC TAT CTT AGO ATG TAT ATG AAG GCT GAT TCT 3451 
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Asn Gin Asn Ala Asn Tyr 'lyr Leu Ser Met Tyr Met Lys Ala Asp Ser 
675 680 685 

ACT ACG GAA COT ACA ATA GAA GTA GCT GOG GAA AAA TCT GCA ATA ACA 3499 
Tte Hhr Glu Pro Thr lie Glu Val Ala Gly Glu Lys Ser Ala lie Thr 
690 695 700 

AGT AAA AAA GTA AAA TTA AAT AAT CAA AAT TAT CAA AGA GTT GAT ATT 3547 
Ser Lys Lys Val Lys Leu Asn Asn Gin Asn Tyr Gin Arg Val Asp lie 
705 710 715 

TTA CriG AAA AAT TCT GAA AGA AAT CCA ATG GAT AAA ATA TAT ATA AGA 3595 
Leu Val Lys Asn Ser Glu Arg Asn Pro Met Asp Lys He Tyr He Arg 
720 725 730 

QGA AAT GQC ACG ACA AAT GTT TAT GGG GAT GAT GTT ACT ATC CCA GAG 3643 
Gly Asn Gly Thr Tta: Asn Val Tyr Gly Asp Asp Val Thr lie Pro Glu 
735 740 745 750 

GTA TCA GCT ATA AAT COG GCT AGT CTA TCA GAT GAA GAA ATT CAA GAA 3691 
Val Ser Ala He Asn Pro Ala Ser Leu Ser Asp Glu Glu He Gin Glu 
755 760 765 

ATA TTT AAA GAC TCA ACT ATT GAA TAT QGA AAT OCT AGT TTC GTT GOT 3739 
He Hie Lys Asp Ser Thr He Glu Tyr Gly Asn Pro Ser Phe Val Ala 
770 775 780 

GAT GCC CTA ACA TTT AAA AAT ATA AAA COT TTA CAA AAT TAT GTA AAG 3787 
Asp Ala Val Thr Phe Lys Asn He Lys Pro Leu Gin Asn Tyr Val Lys 
785 790. 795 

GAA TAT GAA ATA TAT CAT AAA TCT CAT OGA TAT GAA AAG AAA ACG GTC 3835 
Glu Tyr Glu He Tyr His Lys Ser His Arg Tyr Glu Lys Lys Thr Val 
800 805 810 

TTT GAT ATC ATG GGT GTT CAT TAT GAG TAT AGT ATA GCT AGG GAA CAA 3883 
Phe Asp He Met Gly Val His Tyr Glu Tyr Ser He Ala Arg Glu Gin 
815 820 825 830 

AAG AAA GCC GCA TAAITITAAA AATAAAACTC GTTAGAGTrT ATrTAQCATC 3935 
Lys Lys Ala Ala 



CTArrrrrAA GAATAATCAA TATGriTCAAC OGnTOTAGC TGTnTQGAA OQGAATITCA 3995 
TnTATTTQG TCTOTTAAGT TGATGGGCAT GGGATATGTT CAGCATCCAA GCGTTTNGGG 4055 
GGTTANAAAA TCCAATTTT 4074 



(2) INFORMATiai FOR SEQ ID NO: 20: 

(i) SBQUE3^E CHARACITOISTICS : 

(A) LEN3TO: 462 amino acids 

(B) TVPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLEXULE TYPE: protein 

(xi) SEQUENCE DESCRIPTiag: SEQ ID NO: 20: 

Met Gin Arg Met Glu Gly Lys Leu Phe Val Val Ser Lys Thr Leu Gin 
15 10 15 

Val Val Thr Arg Thr Val Leu Leu Ser Thr Val Tyr Ser He Thr Leu 
20 25 30 _ 

Leu Asn Asn Val Val He Lys Ala Asp-Gin Leu Asn He Asn Ser Gin 
35 40 45 

Ser Lys lyr Thr Asn Leu Gin Asn Leu Lys He Pro Asp Asn Ala Glu 
50 55 €0 

Asp Phe Lys Glu Asp Lys Gly Lys Ala Lys Glu Trp Gly Lys Glu Lys 
65 70 75 80 

Gly Glu Glu Trp Arg Pro Pro Ala Thr Glu Lys Gly Glu Met Asn Asn 
85 90 95 

Phe Leu Asp Asn Lys Asn Asp He Lys Thr Asn Tyr Lys Glu He Ihr 
100 105 110 

Phe Ser Met Ala Gly Ser Cys Glu Asp Glu He Lys Asp Leu Glu Glu 
115 120 125 

He Asp Lys He Phe Asp Lys Ala Asn Leu Ser Ser Ser He He Thr 
130 135 140 

Tyr Lys Asn Val Glu Pro Ala Thr He Gly Phe Asn Lys Ser Leu Thr 
145 150 155 160 

Glu Gly Asn Thr He Asn Ser Asp Ala Met Ala Gin Ete Lys Glu Gin 
165 170 175 

Phe Leu Gly Lys Asp Met Lys Phe Asp Ser Tyr Leu Asp Thr His Leu 
180 185 190 

Thr Ala Gin Gin Val Ser Ser Lys Lys Arg Val He Leu Lys Val Thr 
195 200 205 

Val Pro Ser Gly Lys Gly Ser Ihr Thr Pro Thr Lys Ala Gly Val He 
210 215 220 

Leu Asn Asn Asn Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Val Leu 
225 230 235 240 

His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Met Glu Cys Leu 
245 250 255 

Gin Val Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He 
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260 ';z:r 

26.y 270 



^ Ma Clu M, „is S» ^ Sly ^ Glu ^ 

285 

Asn „, se, ^ ^ ^ ^ 

300 ^ 

=ln ASP ^ :^ OXu II, »s„ ^ ^ ^ ^ ^ ^ ^ 

320 

Oly Asn Olu ASP Al. 01„ !«. „e Sg: Asp Al. i 

335 

Oly Lys ^ ne Pro 01„ Asn Ue «^ V.1 ^ A^ ^ ^ 

350 

He. Pro ^„ Oly ^ Xl^ ^ ^ ^ ^ 

Asp Phe Glu Glu Gin Phe Leu Asn Thr He Lv« n,. a t 

370 '^ys Asp Lys Gly Tyr 

380 

Met Ser ^ ser ser Glu Ar« Ala Ala Phe Gly Ser Ar^ 

400 

lie lie uu ^ oln V.1 Pro Lys oly se. cly Al. ^ 

410 

- ^ Ala II, oly cly PHe Ala Glu Lys olu lie Leu Leu Asp 

430 

^ Lys ^ His lie A^ Lvs Ala Glu Val He He Lys 

^ 445 

Gly Arg IVr val val ASP Ala Ihr Leu Leu ihr Asn 



«5 460 



(2) INPORMATiai FXK SBQ ID N0:21: 

(i) SBQUQKTE CHARACTERISTICS: 

(A) LEN3TO: 834 amino acids 
, (B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MDLBCULE TYPE: protein 

<xi) SBQUEJCE DESCRIPTION: SBQ ID N0:21: 

Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Ihr Cys Met Leu 

10 15 

L-u Ala Pro »tet Phe Asn Gly Asn Val Asn Ala Val Asn Ala Asp 

25 30 
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Ser Lys He Asn Gin lie Ser Thr 'Thr Gin Glu Asn Gin Gin Lys Glu 
35 40 45 

Met Asp Arg Lys Gly Leu Leu Giy Tyr Tyr Phe Lys Gly Lys Asp Phe 
50 55 60 

Asn Asn Leu Hhr Met Phe Ala Pro Thr Arg Asp Asn Thr Leu Met Tyr 
65 70 75 80 

Asp Gin Gin Ihr Ala Asn Ala Leu Leu Asp Lys Lys Gin Gin Glu Tyr 
85 90 95 

Gin Ser He Arg Trp He Gly Leu He Gin Arg Lys Glu Ihr Gly Asp 
100 105 no 

Phe Thr Phe Asn Leu Ser Lys Asp Glu Gin Ala He He Glu He Asp 
115 120 125 

Gly Lys He He Ser Asn Lys Gly Lys Glu Lys Gin Val Val His Leu 
130 135 140 

Glu Lys Glu Lys Leu Val Pro He Lys He Glu Tyr Gin Ser Asp Thr 
145 150 155 

Lys Phe Asn He Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys 
165 170 175 

He Asp Ser Gin Asn Gin Ser Gin Gin Val Gin Leu Arg Asn Pro Glu 
180 185 190 

Phe Asn Lys Lys Glu Ser Gin Glu Phe Leu Ala Lys Ala Ser Lys Thr 
195 200 205 

Asn Leu Phe Lys Gin Lys Met Lys Arg Asp He Asp Glu Asp Thr Asp 
210 215 220 

Thr Asp Gly A^ Ser He Pro Asp Leu Trp Glu Glu Asn Gly TVr Thr 
225 230 235 240 

He Gin Asn Lys Val Ala Val Lys Trp Asp Asp Ser Leu Ala Ser Lys 
245 250 255 

Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Asp Ser His Thr Val Gly 
260 265 270 

Asp Pro Tyr Ohr Asp TVr Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser 
275 280 285 

Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe Pro Ser Val 
290 295 300 



Asn Val Ser Met Glu Lys Val He Leu Ser Pro Asn Glu Asn Leu Ser 
305 310 315 320 
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Asn Ser Val Glu Ser His Ser Ser Ihr Asn Trp Ser lyr Thr Asn Thr 
325 330 335 

Glu Gly Ala Ser He Glu Ala Gly Gly Giy E>ro Leu Gly Leu Ser Phe 
340 345 350 

Gly Val Ser Val Hhr Tyr Gin His Ser Glu Thr Val Ala Gin Glu Ttp 
355 360 365 

Gly Thr Ser Thr Gly Asn Thr Ser Gin Phe Asn rhr Ala Ser Ala Gly 
370 375 380 

Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gty Thr Gly AlaTlle 
385 390 395 400 

Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn Asn Thr He 
405 410 415 

Ala T!hr He Ihr Ala Lys Ser Asn Ser Thr Ala Leu Arg He Ser Pro 
420 425 430 

Gly Asp Ser lyr Pro Glu He Gly Glu Asn Ala He Ala He Thr Ser 
435 440 445 

Met Asp Asp Phe Asn Ser His Pro He Thr Leu Asn Lys Gin Gin Val 
450 455 460 

Asn Gin Leu He Asn Asn Lys Pro He Met Leu Glu Thr Asp Gin Thr 
465 470 475 480 

Asp Gly Val Tyr Lys He Arg Asp Thr His Gly Asn He Val Thr Gly 
485 490 495 

Gly Glu Trp Asn Gly Val Thr Gin Gin He Lys Ala Lys Thr Ala Ser 
500 505 510 

He He Val Asp Asp Gly Lys Gin Val Ala Glu Lys Arg Val Ala Ala 

515 520 525 

Lys Asp Tyr Gly His Pro Glu Asp Lys Thr Pro Pro Leu Thr Leu Lys 
530 535 540 

Asp Thr Leu Lys Leu Ser Tyr Pro Asp Glu He Lys Glu Thr Asn Gly 
545 550 555 560 

Leu Leu Tyr Tyr Asp Asp Lys Pro He Tyr Glu Ser Ser Val Met Thr 
565 570 575 

Tyr Asp Glu Asn Thr Ala Lys Glu Val Lys Lys Gin He Asn Asp 
580 585 590 

Thr Thr Gly Lys Phe Lys Asp Val Asn His Leu Tyr Asp Val Lys Leu 
595 600 605 



Thr Pro Lys Met Asn Phe T^u: He Lys Met Ala Ser Leu Tyr Asp Gly 
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610 615 620 

Ala Glu Asn Asn His Asn Ser Leu Gly Thr Trp Tyr Leu Thr Tyr Asn 
625 630 635 640 

Val .Ala Gly Gly Asn Thr Gly Lys Arg Gin Tyr Arg Ser Ala His Ser 
645 650 655 

Cys Ala His Val Ala L^ Ser Ser Glu Ala Lys Lys Lys Leu Asn Gin 
660 665 670 

— ' Asn Ala Asn 'Tyj^JTyr Leu Ser Met tyr Met Lys Ala Asp-^er Thr Ttor 
675" 680 685 

Glu Pro Thr He Glu Val Ala Gly Glu Lys Ser Ala He Thr Ser Lys 
690 695 700 



Lys Val Lys Leu Asn Asn Gin Asn Tyr Gin Arg Val Asp He Leu Val 
705 710 715 720 

Lys Asn Ser Glu Arg Asn Pro Met Asp Lys He Tyr He Arg Gly Asn 
725 730 735 

Gly Iter Thr Asn Val TVr Gly Asp Asp Val Thr He Pro Glu Val Ser 
740 745 750 

Ala He Asn Pro Ala Ser Leu Ser Asp Glu Glu He Gin Glu He Phe 
755 760 765 

Lys Asp Ser Thr He Glu Tyr Gly Asn Pro Ser Phe Val Ala Asp Ala 
770 775 780 

Val Thr Phe Lys Asn He I^ys Pro Leu Gin Asn Tyr Val Lys Glu TVr 
785 790 795 800 

Glu He Tyr His Lys Ser His Arg Tyr Glu Lys Lys Thr Val Phe Asp 
805 810 815 

He Met Gly Val His Tyr Glu Tyr Ser He Ala Arg Glu Gin Lys Lys 
820 825 830 

Ala Ala ' 



(2) INPOimTIC^I FOR SEQ ID N0:22: 



(i) SQQUE2CE CHARACTERISTICS: 

(A) LENGTH : 4041 base pairs 

(B) TYPE: nucleic acid 

(C) STT^ANDECNESS: single 
<D) TOPOLOGY: linear 



(ii> MQLEX::ULE TYPE: CNA (genomic) 
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(ix) FEATURK: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .4038 

product-'"" INFORMATION: /product^ »VIPlA(a)/viP2A(a) fusion 



(xi) SEQUENCE DESCRIPnOI: SBQ ID NO: 22: 

Lys Arg Met Glu Gly Lys Phe Met Val Ser lys S 

B4S- 

iff .?T 5^ AAA ACT GTA TTG C^MST ACA GPT TXCTCT ATA ICT TTA 
val val -nu: Lys tte val ser Thr Val Phe Ser lie s2 S 

855 860 

TTA AAT AAT GAA GTG ATA AAA OCT GAA CAA TTA AAT ATA AAT ICT CAA 
Asn Asn Glu Val He Lys Ala Glu Gin Leu aJ 

875 880 

ACT AAA TAT ACT AAC TTG CAA AAT CTA AAA ATC ACT GAC AAG GTA GAG 
Ser Lys TVr Thr Asnl«a Gin Asn Leu Lys lie Thr ^ S 

S!r'f^5?^°^^^°^AAAGCDGAAAGAATOGQGGAAAGAAAAA 
ASP Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys ^ 

905 

GAAAAAGAGTGGAAACTAACTGCTACTGAAAAAQGAAAAATCAATAAT 
Glu Lys Glu Trp Lys Leu Ihr Ala Ihr Glu Lys Gly ^ Z 
920 

TTT TOA GAT AAT AAA AAT GAT ATA AAG ACA AAT lAT AAA GAA ATT ACT 

Phe Leu Asp Asn Lys Asn Asp lie Lys 1^ Asn -EVr Lys Su ll^ ^ 
335 940 

TCT ATC GCA GQC TCA TIT GAA GAT GAA ATA AAA GAT TTA AAA G^ 
Phe Ser Met Ala Gly Ser Phe Glu Asp Glu He Lys Asp S ^ 

955 

ATT GAT AAG ATG-TIT C3AT AAA ADC AAT CTA TCA AAT TCT ATT ATC ACC 
lie ASP Lys Met Phe ASP Lys Ihr Asn Leu Ser Asn sS III S S 
'^5 970 975 

TAT AAA AAT GTC GAA OOG ACA ACA ATT GGA TIT AAT AAA TCT TTA ACA 
IVr Asn Val Glu Pro Tfa: Bn: lie Gly Phe Asn Lys Ser Su 5£ 

985 990 

^ ^^"^ ^ ATC GCA CAG TIT AAA GAA CAA 
Glu Gly Asn Ihr He Asn Ser Asp Ala Met Ala Gin Phe Lys Su 

1000 1005 1010 

TTT TTA GAT ACM GAT ATT AAS TTT GAT AGT TAT CTA GAT ACG CAT TTA 
Fte Asp Arg Asp lie Lys Phe Asp ser TVr Leu Asp S Ss JIu 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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1015 1020 1025 

ACT GCT CAA CAA GTT TCC ACT AAA GAA AGA GTT ATT TIG AAG GTT ACG 624 
Ttir Ala Gin Gin Val Ser Ser Lys Glu Arg Val He Leu Lys Val Ttir 
1030 1035 1040 

GTT CCG AGT QGG AAA GGT TCT ACT ACT CCA ACA AAA GCA GGT GTC ATT 672 
Val Pro Ser Gly Lys Gly Ser Ttvc Thr Pro Thr Lys Ala Gly Val He 
1045 1050 1055 

TTA AAT AAT AGT GAA TAC AAA ATG CTC AT T GAT AAT GOG TAT ATG GTC 720 
Leu^Asn: Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Met Val 
1060 1065 1070 _ 

CAT GTA GAT AAG GTA TCA AAA GTG GTG AAA AAA QGG GTS GAG TGC TTA 768 
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu 
1075 1080 1085 1090 

CAA ATT GAA GGG ACT TTA AAA AAG AGT CTT GAC TTT AAA AAT GAT ATA 816 
Gin He Glu Gly Hir Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He 
1095 1100 1105 

AAT GCT GAA GCG CAT AGC TOG GCT ATG AAG AAT TAT GAA GAG TGG OCT 864 
Asn Ala Glu Ala His Ser Ttp Gly Met Lys Asn Tyr Glu Glu Trp Ala 
1110 1115 1120 

AAA GAT TTA ACC GAT TOG CAA AQG GAA GCT TTA GAT GGG TAT GCT AQG 912 
Lys Asp Leu Thr Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Arg 
1125 1130 1135 

CAA GAT TAT AAA GAA ATC AAT AAT TAT TIA AGA AAT CAA GGC QGA ACT 960 
Gin Asp Tyr Lys Glu He Asn Asn TVr Leu Arg Asn Gin Gly Gly Ser 
1140 1145 1150 

GGA AAT GAA AAA CTA GAT GCT CAA ATA AAA AAT ATT TCT GAT GCT TTA 1008 
Gly Asn Glu Lys Leu Asp Ala Gin He Lys Asn He Ser Asp Ala Leu 
1155 1160 1165 1170 

GGG AAG AAA CCA ATA CCG GAA AAT ATT ACT GTG TAT AGA TGG TCT GGC 1056 
Gly Lys Lys Pro He Pro Glu Asn He Otar Val TVr Arg Trp Cys Gly 
1175 1180 1185 

ATG COG GAA TTT GCT TAT CAA ATT ACT GAT COG TTA OCT TCT TTA AAA 1104 
Met Pro Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys 
1190 1195 1200 

GAT TTT GAA GAA CAA TTT TTA AAT ACA ATC AAA GAA GAC AAA GGA TAT 1152 
Asp Phe Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly TVr 
1205 1210 1215 

ATG ACT ACA AGC TTA TOG ACT GAA OCT CTT OCA GCT TTT GGA TCT AGA 1200 
Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg 
1220 1225 1230 

AAA ATT ATA TTA CGA TTA CAA GTT CCG AAA GGA ACT AOG OCT GCG TAT 1248 
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Lys lie He Leu Arg Leu Gin Val Pro Lys Gly Ser Thr Giy Ala Tyr 
1235 1240 1245 1250 

TTA AGT GCC ATT GGT GGA TTT GCA ACT GAA AAA GAG ATC OTA CTT GAT 1296 
Leu Ser Ala He Gly Gly Phe Ala Ser Glu Lys Giu He Leu Leu Asp 
1255 1260 1265 

AAA GAT AGT AAA TAT CAT ATT GAT AAA GTA ACA GAG GTA ATT ATT AAA 1344 
Lys Asp Ser Lys Tyr His He Asp Lys Val Hhr Glu Val He He Lys 
1270 1275 1280 

GGT GTT AAG CGA_TAT GTA GTC^T OCA ACA TTA TTA ACA AAT ATG AAA 1392 
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn Met Lys 
1285 1290 1295 

AAT ATG AAG AAA AAG TTA GCA AGT GTT GTA ACG TGT AOG TTA TTA GOT 1440 
Asn Met Lys Lys Lys Leu Ala Ser Val Val Tlir Cys 'Vhr Leu Leu Ala 
1300 1305 1310 

COT ATG TIT TTG AAT GGA AAT CTG AAT GOT GTT TAG GCA GAC AGC AAA 1488 
Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Tyr Ala A^ Ser Lys 
1315 1320 1325 1330 

ACA AAT CAA ATT TCT ACA ACA GAG AAA AAT CAA CAG AAA GAG ATG GAC 1536 
Thr Asn Gin He Ser llir Ihr Gin Lys Asn Gin Gin Lys Glu Met Asp 
1335 1340 1345 

OGA AAA GGA TTA CTT GGG TAT TAT TIC AAA GGA AAA GAT TIT AGT AAT 1584 
Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe Ser Asn 
1350 1355 1360 

CTT ACT ATG TTT GCA CCG ACA OCT GAT ACT ACT CTT ATT TAT GAT CAA 1632 
Leu Thr Met Phe Ala Pro Ihr Arg Asp Ser Thr Leu He Tyr Asp Gin 
1365 1370 1375 

CAA ACA GCA AAT AAA CTA TTA GAT AAA AAA CAA CAA GAA TAT CAG TCT 1680 
Gin Thr Ala Asn Lys Leu Leu Asp Lys Lys Gin Gin Glu Tyr Gin Ser 
1380 1385 1390 

ATT CCT TC5G ATT GCT TTG ATT CAG AGT AAA GAA ACG GGA GAT TTC ACA 1728 
— lie Pxg Trp He Gly Leu He Gin Ser Lys Glu Thr Gly Asp Phe Thr 
1395 1400 1405 1410 

TTT AAC TTA TCT GAG GAT GAA CAG GCA ATT ATA GAA ATC AAT GOG AAA 1776 
Phe Asn Leu Ser Glu Asp Glu Gin Ala He He Glu He Asn Gly Lys 
1415 1420 1425 

ATT ATT TCT AAT AAA GOG AAA GAA AAG CAA GTT GTC CAT TTA GAA AAA 1824 
He He Ser Asn Lys Gly Lys Glu Lys Gin Val Val His Leu Glu Lys 
1430 1435 1440 

GGA AAA TTA GTT CCA ATC AAA ATA GAG TAT CAA TCA GAT ACA AAA TTT 1872 
Gly Lys Leu Val Pro He Lys He Glu Tyr Gin Ser Asp Thr Lys Phe 
1445 1450 1455 
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AAT ATT GAC AGT AAA ACA TIT AAA GAA CTT AAA TTA TIT AAA ATA GAT 1920 
Asn lie Asp Ser Lys Thr Phe Lys Glu Lieu Lys Leu Phe Lys lie Asp 
1460 1465 1470 

AGT CAA AAC CAA CCC CAG CAA GTC CAG CAA GAT GAA CTG AGA AAT OCT 1968 
Ser Gin Asn Gin Pro Gin Gin Val Gin Gin Asp Glu Leu Arg Asn Pro 
1475 1480 1485 1490 

GAA TTT AAC AAG AAA GAA TCA CAG GAA TVC TTA GCG AAA CCA TCG AAA 2016 
Glu £4ie Asn Lys Lys Glu Ser Gin Glu Phe Leu Ala Lys Pro Ser Lys 
1495 1500 1505 

ATA AAT CTt'tTC ACT CAA AAA ATG AAA AGG GAA ATT GAT GAA GAC AOG 2064 
He Asn Leu Phe Thr Gin Lys Met Lys Arg Glu He Asp Glu Asp Thr 
1510 1515 1520 

GAT AOG GAT GOG GAC TCT ATT OCT GAC CTT TGG GAA GAA AAT GGG TAT 2112 
Asp Asp Gly Asp Ser He Pro Asp Leu Trp Glu Glu Asn Gly Tyr 
1525 1530 1535 

ACG ATT CAA AAT AGA ATC OCT GTA AAG TOG GAC GAT TCT OTA GCA AGT 2160 
Tftir He Gin Asn Arg He Ala Val Lys Trp Asp Asp Ser Leu Ala Ser 
' 1540 1545 1550 

AAA GGG TAT ACG AAA TTT GTT TCA AAT OCA CTA GAA AGT CAC ACA GTT 2208 
Lys Gly lyr Hhr Lys Phe Val Ser Asn Pro Leu Glu Ser His Ihr Val 
1555 1560 1565 1570 

GGT GAT CCT TAT ACA GAT TAT GAA AAG GCA GCA AGA GAT CTA GAT TTG 2256 
Gly Asp Pro lyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu Asp Leu 
1575 1580 1585 

TCA AAT OCA AAG GAA ACG TTT AAC CCA TTG GTA OCT GOT TIT CCA AGT 2304 
Ser Asn Ala Lys Glu Ihr Phe Asn Pro Leu Val Ala Ala Phe Pro Ser 
1590 1595 1600 

GTG AAT GTT AGT ATG GAA AAG GTG ATA TTA TCA CCA AAT GAA AAT TTA 2352 
Val Asn Val Ser Met Glu Lys Val He Leu Ser Pro Asn Glu Asn Leu 
1605 1610 1615 

TCC AAT AGT GTA GAG TCT CAT TCA TCC AOG AAT TOG TCT TAT ACA AAT 2400 
Ser Asn Ser Val Glu Ser His Ser Ser Ihr Asn Trp Ser Tyr Ihr Asn 
1620 1625 1630 

ACA GAA GGT OCT TCT GTT GAA GCG GGG ATT GGA OCA AAA GGT ATT TOG 2448 
Thr Glu Gly Ala Ser Val Glu Ala Gly He Gly Pro Lys Gly He Ser 
1635 1640 1645 1650 

TTC GGA GTT AGC GTA AAC TAT CAA CAC TCT GAA ACA GTIT GCA CAA GAA 2496 
Phe Gly Val Ser Val Asn Tyr Gin His Ser Glu Thr Val Ala Gin Glu 
1655 1660 1665 

TGG GGA ACA TCT ACA GGA AAT ACT TCG CAA TTC AAT ACG OCT TCA GCG 2544 
Trp Gly Thr Ser Thr Gly Asn Thr Ser Gin E^e Asn Thr Ala Ser Ala 
1670 1675 1680 
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GGA TAT TTA AAT GCA AAT GTT CX5A TAT AAC AAT GTA GGA ACT GGT GCC 2592 
Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr Gly Ala 
1685 1690 1695 

ATC TAC GAT GTA AAA CCT ACA ACA AGT TTT GTA TTA AAT AAC GAT ACT 2640 
lie Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn Asp Hir 
1700 1705 1710 

ATC GCA ACT ATT ACG GOG AAA TCT AAT TCT ACA GCC TTA AAT ATA TCT 2688 
lie. Ala Thr lie Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn lie Ser 
1715 1720 1725 1730 

CCT GGA GAA AGT TAC COG AAA AAA GGA CAA AAT GGA ATC GCA ATA ACA 2736 
Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gin Asn Gly lie Ala lie Thr 
1735 1740 1745 

TCA ATG GAT GAT TTT AAT TOC CAT CCG ATT ACA TTA AAT AAA AAA CAA 2784 
Ser Met Asp Asp Phe Asn Ser His Pro lie Thr Leu Asn Lys Lys Gin 
1750 1755 . 1760 

GTA GAT AAT CTG CTA AAT AAT AAA CCT ATG ATG TTG GAA ACA AAC CAA 2832 
Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr Asn Gin 
1765 1770 1775 

ACA GAT GGT GTT TAT AAG ATA AAA GAT ACA CAT GGA AAT ATA GTA ACT 2880 
Thr Asp Gly Val TVr Lys lie Lys Asp Thr His Gly Asn lie Val Thr 
1780 1785 1790 

GGC GGA GAA TGG AAT OCT GTTC ATA CAA CAA ATC AAG GOT AAA ACA GOG 2928 
Gly Gly Glu Trp Asn Gly Val He Gin Gin He Lys Ala Lys Thr Ala 
1795 1800 1805 1810 

TCT ATT ATT GTG GAT GAT GOG GAA CGT GTA GCA GAA AAA OGT GTA GCG 2976 
Ser He He Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg Val Ala 
1815 1820 1825 

GCA AAA GAT TAT GAA AAT CCA GAA GAT AAA ACA CCG TCT TTA ACT TTA 3024 
Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu Ihr Leu 
1830 1835 1840 

AAA GAT GCC CTG AAG CTT TCA TAT CCA GAT GAA ATA AAA GAA ATA GAG 3072 
Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu He Lys Glu He Glu 
1845 1850 1855 

GGA TTA TTA TAT TAT AAA AAC AAA COG ATA TAC GAA TOG AGO GTIT ATG 3120 
Gly Leu Leu Tyr Tyr Lys Asn Lys Pro He Tyr Glu Ser Ser Val Met 
1860 1865 1870 

ACT TAC TTA GAT GAA AAT ACA GCA AAA GAA GTG AOC AAA CAA TTA AAT 3168 
Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gin Leu Asn 
1875 1880 1885 1890 

GAT ACC ACT GOG AAA TTT AAA GAT GTA AGT CAT TTA TAT GAT GTA AAA 3216 
Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu lyr Asp Val Lys 
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1895 



1900 



1905 



CTG ACT CCA AM ATC AAT ACA ATC AM TIG OCT ATA CTT TAT r^T 
Leu Thr Pro Lys^M^t Asn Val He^Lys Leu He Z 

MT GCT GAG TCr MT GAT AACTCA ATT GC?r AM TCG ACA Aar apa 
Asn Ala Olu^ser Asn Asp Asn ^^^Ue Cly ?S £ ^ S JIS! 

?II ST ^ ^ "^"^ «^ ^ ^ CM TAT TCT TCT MT AAT 

1945 1950 

ore GAT GCT MT TTG ACA TTA MT ACA GAT GCT CM GM AM TTA AAT 

Pro^Asp Ala Asn Leu Asn ASP Ala Si S 

1965 197Q 

^^^5 2000 

ZDIQ 2015 
GCT CAT MT ATA AAA AGT AAT CXIA ATT TT^ TY-a r-m. 
Ala His Asn lie Lys Ser Asn JII S S 2! SI J?' 
2020 onot; ^ Lys Thr 

^"^^ 2030 

AAT GAT GAA 
Asn Asp Glu 
2035 



CSCA TCA ATA 
Ala Ser He 



TAT AGT AGG 
lyr Ser Arg 



AAA GGT GGG 
Lys Gly Gly 
2085 



ATA ACT TTA TIT TOG GAT GAT ATT TCP ATA ACA GAT OTA 
^ 2'^o''^" Asp ASP lie ser Ili ^ ^ Z 
^^^^ 2045 ^ 2050 

AAA era GAA AAT TTA ACA GAT TCA GAA ATT AAA 
^ ^ Olu ^^■n.^ser Glu He ^ S 

^050 2065 

TAT OCT ATT AAG TTA GM GAT GGA ATC err ATT GAT aaa 

^ ne 0X„ MP <ay S 1^ 

2075 2080 ' 

ATT CAT TAT GGT GM TIT ATT MT GM GOT 'A£?r TIT MT 
lie Hxs tVr Gly Glu Phe He Asn Glu Ala Ser ^ 
2090 2095 



S ^ ^ AAT TAT GTS ACT AM TAT GM GTT ACT TAT ACT 

lie Glu^Pro Leu Gin Asn IVr^Val ^ Tyr Glu S S 5^ S 

ACT GAG TTA GGA CCA AAC GTC AGT GAC ACA CTT GM AC?I> GAT AM ATT 



3264 



3312 



3360 



3408 



3456 



3504 



3552 



3600 



3648 



3696 



3744 



3792 



3840 



3888 
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Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp Lys lie 
2115 2120 2125 2130 

TAG AAG GAT GOG ACA ATT AAA TTT GAT TTT ACC AAA TAT AGT AAA AAT 3936 
lyr Lys Asp Gly Thr lie Lys Phe Asp Phe Tlir Lys Tyr Ser Lys Asn 
2135 2140 2145 

GAA CAA QGA TTA TTT TAT GAG AGT GGA TTA AAT TGG GAC TTT AAA ATT 3984 
Glu Gin Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe Lys He 
2150 2155 2160 

AAT OCT ATT ACT TAT GAT GGT AAA GAG ATG AAT GTT TTT CWP-AGA TAT 4032 
Asn Ala He Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His Arg Tyr 
2165 2170 2175 

AAT AAA TAG 4941 
Asn Lys 
2180 



(2) INFORMATION FOR SBQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENSra: 1346 andno acids 

(B) TVTPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:23: 

Met Lys Arg Met Glu Gly Lys Leu Ete Met Val Ser Lys Lys Leu Gin 

1 .5. 10 15 

Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser He Ser Leu 

20 25 30 

Leu Asn Asn Glu Val He Lys Ala Glu Gin Leu Asn He Asn Ser Gin 
35 40 45 

Ser Lys Tyr Thr Asn Leu Gin Asn Leu Lys lie Thr Asp Lys Val Glu 
50 55 60 

Asp E^e Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys 
65 70 75 80 

Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn 
85 90 95 

E*ie Leu Asp Asn Lys Asn Asp He Lys Tiir Asn Tyr Lys Glu He Thr 

100 105 110 

Phe Ser Met Ala Gly Ser Phe Glu Asp Glu He Lys Asp Leu Lys Glu 
115 120 125 
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lie Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser lie lie Thr 
130 135 140 

Tyr Lys Asn Val Glu Pro Thr Thr lie Gly Phe Asn Lys Ser Leu Ihr 
145 150 155 160 

Glu Gly Asn Thr He Asn Ser Asp Ala Met Ala Gin Phe Lys Glu Gin 
165 170 175 

Phe h&a Asp Arg Asp lie Lys Phe Asp Ser Tyr Leu Asp Thr His Leu 
180 185 190 

Thr Ala Gin Gin Val Ser Ser Lys Glu Arg Vat-He Leu Lys V^l Ihr 
195 200 205 

Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val lie 
210 215 220 

Leu Asn Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Met Val 
225 230 235 240 

His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu 
245 250 255 

Gin He Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He 
260 265 270 

Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala 
275 280 285 

Lys Asp Leu Thr Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Arg 
290 295 300 

Gin Asp Tyr Lys Glu He Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser 
305 310 315 320 

Gly Asn Glu Lys Leu Asp Ala Gin He Lys Asn He Ser Asp Ala Leu 
325 330 335 

Gly Lys Lys Pro He Pro Glu Asn He Thr Val Tyr Arg Trp Cys Gly 
340 345 350 

Met Pro Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys 
355 360 365 

Asp Phe Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly TVr 
370 375 380 

Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg 
385 . 390 395 . 400 

Lys He He Leu Arg Leu Gin Val Pro Lys Gly Ser Thr Gly Ala Tyr 
405 410 415 

Leu Ser Ala He Gly Gly Phe Ala Ser Glu Lys Glu He Leu Leu Asp 
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420 



A21 



430 



Lys Asp Ser Lys Tyr His lie Asp Lys Val Thr Glu Val He He Lvs 
435 440 



445 



Gly yal Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn Met Lys 

450 Atzc . 



455 



460 



Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr C/s Thr Leu Leu Ala 

Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Tyr Ala Asp Ser Lys 
485 490 

Uir Asn Gin lie Ser Thr mir Gin Lys Asn Gin Gin Lys Glu Met Asd 
500 505 510 

Arg Lys Gly Leu Leu Gly T/r Tyr Phe Lys Gly Lys Asp Phe Ser Asn 
515 520 525 

Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu He Tyr Asp Gin 

530 535 

Gin Thr Ala Asn Lys Leu Leu Asp Lys Lys Gin Gin Glu Tyr Gin Ser 



550 



555 



560 



He Arg Trp He Gly Leu He Gin Ser Lys Glu Ihr Gly Asp Phe Thr 
565 570 



575 



Phe Asn Leu Ser Glu Asp Glu Gin Ala He He Glu He Asn Gly Lys 
580 585 590 

He He Ser Asn Lys Gly Lys Glu Lys Gin Val Val His Leu Glu Lys 
595 600 605 

Gly Lys Leu Val Pro He Lys He Glu TVr Gin Ser Asp Thr Lys Phe 
610 615 620 

Asn He Asp Ser Lys Thr Phe Lys Glu Leu lys Leu Phe Lys He Asp 



635 



640 



Ser Gin Asn Gin Pro Gin Gin Val Gin Gin Asp Glu Leu Arg Asn Pro 
645 650 655 

Glu Phe Asn Lys Lys Glu Ser Gin Glu Phe Leu Ala Lys Pro Ser Lvs 

665 670 

He Asn Leu Phe Ihr Gin Lys Met Lys Arg Glu He Asp Glu Asp Thr 
675 680 685 

Asp Thr Asp Gly .Asp Ser He Pro Asp Leu Trp Glu Glu Asn Glv Tvr 
690 695 700 

Ihr He Gin Asn Arg He Ala Val Lys Trp Asp Asp Ser Leu Ala Ser 
710 715 



wo 97/46105 



PCT/EP97/02737 



81 - 



Lys Gly lyr I'hr Lys Phe Val Ser Asn Pro Leu Glu Ser His Thr Val 
725 730 735 

Gly Asp Pro Tyr Ihr Asp Tyr Glu Lys Ala Ala Arg Asp Leu Asp Leu 
740 745 750 

Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe Pro Ser 
755 760 765 

Val Asn Val Ser Met Glu Lys Val lie Le u Se r Pro Asn Glu Asn Leu 
_ -^m 775 780 

Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr Thr Asn 
785 790 795 800 

Thr Glu Gly Ala Ser Val Glu Ala Gly He Gly Pro Lys Gly He Ser 
805 810 815 

Phe Gly Val Ser Val Asn Tyr Gin His Ser Glu 'Vhr Val Ala Gin Glu 
820 825 830 

Trp Gly Thr Ser Thr Gly Asn Thr Ser Gin Phe Asn Thr Ala Ser Ala 
835 840 845 

Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr Gly Ala 
850 855 860 

He Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn Asp Thr 
865 870 875 880 

He Ala Thr He Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn He Ser 

885 890 895 

Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gin Asn Gly He Ala He Thr 
900 905 910 

Ser Met Asp Asp E^e Asn Ser His Pro He Thr Leu Asn Lys Lys Gin 
915 920 925. 

Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr Asn Gin 
930 935 940 

The Asp Gly Val Tyr Lys He Lys Asp Thr His Gly Asn He Val Thr 
945 950 955 960 

Gly Gly Glu Trp Asn Gly Val He Gin Gin He Lys Ala Lys Thr Ala 
965 970 975 

Ser He He Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg Val Ala 
980 985 990 



Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu Thr Leu 
995 1000 1005 
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Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu lie Lys Glu He Glu 
1010 1015 1020 

Gly Leu Leu lyr Tyr Lys Asn Lys Pro He Tyr Glu Ser Ser Val Met 
1025 1030 1035 1040 

Ihr Tyr Leu Asp Glu Asn Ihr Ala Lys Glu Val Thr Lys Gin Leu Asn 
1045 1050 1055 

Asp Thr Ttor Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp Val Lys 
' 1060 1065 1070 

Leu Ihr Pro Lys Met Asn Val Thr He Lys Leu Ser He Leu Tyr Asp 
1075 1080 1085 

Asn Ala Glu Ser Asn Asp Asn Ser He Gly Lys Trp Thr Asn Thr Asn 
1090 1095 1100 

He Val Ser Gly Gly Asn Asn Gly Lys Lys Gin Tyr Ser Ser Asn Asn 
1105 1110 1115 1120 

Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gin Glu Lys Leu Asn 
1125 1130 1135 

Lys Asn Arg Asp Tyr T^r He Ser Leu Tyr Met Lys Ser Glu Lys Asn 
1140 1145 1150 

Thr Gin Cys Glu He Thr He Asp Gly Glu He Tyr Pro He Thr Thr 
1155 1160 1165 

Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp He He 
1170 1175 1180 

Ala His Asn He Lys Ser Asn Pro He Ser Ser Leu His He Lys Ttir 
1185 1190 . . 1195 1200 

Asn Asp Glu He Thr Leu Phe Trp Asp Asp He Ser He Thr Asp Val 
1205 1210 1215 

Ala Ser He Lys Pro Glu Asn Leu Thr Asp Ser Glu He Lys Gin He 
1220 1225 1230 

Tyr Ser Arg Tyr Gly He Lys Leu Glu Asp Gly He Leu He Asp Lys 
1235 1240 1245 

Lys Gly Gly He His Tyr Gly Glu Phe He Asn Glu Ala Ser Phe Asn 
1250 1255 1260 

He Glu Pro Leu Gin Asn Tyr Val Thr Lys Tyr Glu Val Tlir Tyr Ser 
1265 1270 1275 1280 

Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp Lys He 
1285 1290 1295 

Tyr Lys Asp Gly Thr He Lys Phe Asp Phe Thr Lys Tyr Ser Lys Asn 
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1300 1305 1310 

Glu Gin Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe Lys He 
1315 1320 1325 

Asn Ala He Thr Tyr Asp Gly Lys Glu Jfet Asn Val Phe His Arg Tyr 
1330 1335 1340 

Asn Lys 
1345 

(2r-INP0RMATI0N FOR SBQ ID NO: 24: 

(i) SEXJUHMCE CHARACITOISTICS: 

(A) LENGTH: 1399 base pairs 

(B) TOPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TOPE: ENA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: inisc_feature 

(B) LOCATION: 1..1386 

(D) OTHER INFCM1ATICN: /note=: "Maize cptijnized EMA 
sequence for VIP2A{a) protein from AB78" 



(xi) SEQUENCE DESGRIPTIOI: SEQ ID N0:24: 



ATGAAGCQCA TGGAGQQCAA GCTGrTTCATG GrTGAGCAAGA AGCTOCAGGT 


GGTGAOCAAG 


60 


ACCGnxscrcc tgagcacogt gttcagcatc aqoctqctga acaaogaqgt 


GATCAAOGCC 


120 


GAQCAGCTGA ACATCAACAG CCAGAGCAAG TACAOCAAOC TCCAGAACCT 


GAAGATCACC 


180 


GACAAOGTGG AGGACTICAA GGAGGACAAG GAGAAQQCCA AQGAGTQGGG 


CAAOGAGAAG 


240 


GAGAAGGAGT QGAAGCITAC OGCCACOGAG AAGQGCAAGA TGAACAACTT 


CCTQGACAAC 


300 


AAGAACGACA TCAAGACCAA CTACAAQGaG ATCACCTTCA GCATGX3C0GG 


GAGCTTOGAG 


360 


GACGAGATCA AOGAOCTGAA GGAGATQGAC AAGATGTTOG ACAAGAC!CAA 


CCIGAOCAAC 


420 


AGCATCATCA OCTACAAGAA OGTGGAGCCC AOCACCATCG GCTTCAACAA 


GAGOCTGACC 


480 


GAGGQCAACA CCATCAACAG CGACGCCATG QCCCAGTrCA AGGAGCAGTT 


CCIGGACOGC 


540 


GACATCAAGT TOGACAGCTA CCTQGACACC CAOCTGAOCG OXIAQCAGGT 


GAOCAOCAAG 


600 


GAGCGCGTGA TCCTGAAQGT GACOG?rOOOC AGOQOCAAOG GCAGCACCAC 


OOCCWTAAG 


660 


QCCQGQGTCA T0CTC5AACAA CAOCGAGTAC AAGATGCTGA TOGACAAOQG 


CTACATQGTG 


720 
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ArYTprj Ar^p A A 




AAPPPPPfTYTP RnTV^nmv>nA nTvov^^Arv^r^/^ 


780 


^fYY^'Vnh. Zip A 


upAryppTAi"^ 


Ui lWv»iAAL 


UALAl\.i\ALu (JCGACSGCXXA CAGCTQGGGC 


840 




ALAjin^Mflja Ivr 


Uo<JL.AAi<^jiAL. 


C^iXaAUUtaALA GCCAGOQuGA GGCCCTOGAC 


900 






Pa A^y^ An ATIT' 

CAALAdAlaAi\« 


AACAACITvuC TSOGCAACCA GGGCX3GCAGC 


960 




Auv» l\AiA\JviC 


/^APATP A AP 


AALAiWVjCAa ACQCXCiUUG CAAGAAOOCC 


1020 


AIvUCUbAtaA 


A!UA\L\JAUL\j 1 


OTADUGCTQG 


TGOQGCATGC CCGAGTITCGG CTACCAGATC 


1080 


AGCS3ACCCCC 


TGCOCAQCCT 


GAAQGACTTC 


GAQGAGCA(3T TCCTGAACAC CATCAAGGAG 


1140 


GACAAGQGCT 


ACATGAQCAC 


CAGOCTGAGC 


AQCGACXX3CX: TOGCXXSCXHT CXXXAQC30QC 


1200 


AAGATCATCJC 


TGOGCCTQCA 


GGTIGCXXAAG 


QGCAGCA0CX3 GOGCCTACCT GAOCOCXMC 


1260 


GGOGQCTTCG 


CCAGCX^AGAA 


GGAGATOCTC 


CTQGACAAQG ACAGCAAGTA CCACATOGAC 


1320 


AAGGTGACCG 


AGGTGATCAT 


CAAQGGOGTO 


AAGOGCTADG TOGTCGACGC CACCCTOCTG 


1380 


ACCAACTAGA 


TCTGAC3CTC 






1399 



(2) INFORMATION FOR SBQ ID NO:25: 

(i) SE3QUENCE CHARACTERISTICS: 

(A) LEN3TH: 19 amino acids 

(B) TYPE: amino acid 

(C) SrniANDEIXlESS: single 

(D) TOPOLCGy: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATORE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..19 

(D) aniER INFORMATION: /note= "Secretion signal peptide to 
secrete VIP2 out of a cell" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gly Trp Ser Trp He Phe Leu Hie L«i Leu Ser Gly Ala Ala Gly Val 
1 5 10 15 

His Cys Leu 



(2) INPORMATIC^J FOR SBQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2655 base pairs 

(B) TYPE: nucleic acid 
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(C) STR?\NDEDNESS : single 
(D> TOPOIjOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTIC3N: /desc = "Synthetic ENA" 

(iii) HYPOTHBTICAL: NO 



(ix) FEATORE: 

(A) NAME/KEY: itdsc_featiire 

(B) LOCATION: 1..2655 

(D) OTHER INPORMATICN: /note= "maize pptiitiized DMA 
sequence encoding VIPlA(a) " 



(xi) SEfJJENCE DESCRIPTION: SBQ ID NO: 26 






ATGAAGAACA 


TGAAGAAGAA 


GCTQQCCAQC 


GTOGTGACCT 


GCAOCTTGCT GGCOCCCATG 


60 


TTCCTGAACX; 


OCAACGTGAA 


0GC0GTC3TAC 


QOOGACAQCA 


AGACCAAOCA GATCAGCACC 


120 


AOCCAGAAGA 


AOCAOCAGAA 


GGAGATQGAC 


aSCAAQGQOC 


TQCTQQQCTA CTACTICAAG 


180 


QQCAAQGACT 


TCAQCAAOCT 


GACCATGITC 


QCXXX3CA0QC 


GTCACAQCAC CCTCATCTAC 


240 


GAOCAGCAGA 


CCX3CCAACAA 


GCTGCTQGAC 


AAGAAGCAGC 


AGGAGTACCA GAGCATCCGC 


300 


TGGATOGGCC 


TGATCCAGAG 


CAAGGAGACC 


GQCX3ACTTCA 


CCTTCAACCT GAGOGAGGAC 


360 


GAGCAQQQCA 


TCATOGAGAT 


CAAOGGCAAG 


ATCATCAGCA 


ACAAGGQCAA GGAGAAGCAG 


420 


GK5GTQCA0C 


TGGAGAAGGG 


CAAGCTGGfTG 


OCCATCAAGA 


TOQAGTAOCA GAGCGACAOC 


480 


AAGTTCAACA 


TOGACAGCAA 


GACCnCAAG 


GAOCTGAAOC 


TTTTCAAGAT OSACAGCCAG 


540 


AACX^GCCCX: 


AGCAGGTOCA 


GCAGGADGAG 


CTGCGCAACC 


CCGAGTICAA CAAGAAGGAG 


600 


AGQCAGGAGT 


TCCTQQOCAA 


GCCCAGCAAG 


ATCAACCTGT 


TCACXX^GCA GATGAAGOQC 


660 


GAGATCGA06 


AGGACACXXSA 


CACOGAOGGC 


GACAGCATCC 


CXX3A0CTGTG GGAGGAGAAC 


720 


GQCTACACCA 


TQCA«3AA0CG 


CATOQCXXTTG 


AAC?ia3GACG 


ACAGOCTQGC TAOCAAQQQC 


780 


TACACCAAGT 


TOGTGMSCAA 


CXXCCTQGAG 


AGCCACACCG 


TQQGOGACCC CTACACCGAC 


840 


TACGAQAAGG 


COQCCOQOGA 


CCTQGACCTG 


AGCAAOSCCA 


AGGAGAOCTT CAACCCCCTG 


900 


GTGQCCGOCr 


TCCCCAQOCT 


GAAiGGTGAGC 


ATGGAGAAGG 


TGATCCIGAG CCCTAAOGAG 


960 


AAOCTCAGCA 


ACAOOGTGGA 


GAGOCACTOG 


AGCAOCAACT 


QGAGCTACAC CAACA0C3GAG 


1020 


GQCGCCAGOG 


TQGAQGCOQG 


CAT0C3GTCCC 


AAGQGCATCA 


GCTTCOGOGT GAGCGTGAAC 


1080 


TACCAOCACA 


GCGAGAOOGT 


GGCCCAOGAG 


TQOQGCAOCA 


GCACOGGCAA CACCAGCCAG 


1140 



wo 97/46105 



PCT/EP97/02737 



■86- 

TTCAACACCX3 CCAGCQXQG CTACCTGAAC GCCAACGTQC GCTACAACAA CGTCQC3CACC 1200 

QGCGCCATCT AC3GACGTCAA QCCCACCACC AQCTTOGTOC TGAAC3\ACX3A CACCATCGCC 1260 

ACXIATCACXX; CCAAGTOGAA TTCCACCXSCC CTGAACATCA GCCCCGGCGA GAGCTACCXr 1320 

AAGAAQGGCC AGAACX3QCAT CGCXMCACC AGCATOGACG ACITCAACAG CCACCCCATC 1380 

AOCCTGAACA AGAAGCAGGT QGACAAOCTG CTCAACAACA AG0CCATG5AT GCTGGAGACC 1440 

AACCAGAOCG AQGQCX3TCTA CAAGATCAAG GftCACCTACG GCAACATOGT GaOSQGCQGC 1500 

GAGTOGAAOG QOGrTGATCCA GCAGATCAAG QOCAAGACOS CCAGCATCAT OGTCGACGAC 1560 

QGOGAGCGCG TQQCCGAGAA GCG0GTQC3CC GCXIAAQGACT ACX3AGAACCC OGAGGACAAG 1620 

ACXXXXAGCr TGaOCCTGAA GGACX3CXXTG AAGCTGAGCT ACCXXX3ACX3A GATCAAGGAG 1680 

ATCGAQQQCT TCXTECTACTA CAAGAflCAAG COCATCTAOS AGftGCAQOGT GATGAOCTAT 1740 

CTAGACX5AGA ACACXXTCAA QGAQGIGAOC AAQCAQCTGA AOSACACXAC OQGCAAGTTC 1800 

AAGQAjDGTGA GOCACCTOEA CXSAOJIGAAG CTGACXXXX3V AGATCAAOGT (3UXMCAAG 1860 

CTGAGCATCC TGTAOGACAA CX3C0GAGAGC AADGACAACA GCATCGGCAA GTOGACCAAC 1920 

ACCAACATCG TGAGOQGOQG CAACAACX3GC AAGAAQCAGT ACAGCAGCAA CAACCCCGAC 1980 

GCCAACCTGA CCCTGAACAC OGAOGCCCAG GAGAAQCTGA ACAAGAACXX3 CX3ACTACTAC 2040 

ATCAGCXriGT ACATCAAGAG OGAGAAGAAC ACCCAGTIGOG AGATC2CCAT CXSAOSGOGAG 2100 

ATATAOXGA TCACCACXAA GACCX3TGAAC GTCAACAAQG ACAACTACAA QCX3CXTOGAC 2160 

ATCATOGCCC ACAACATCAA GAQCAACCCX: ATCAGCAGCX: TOCACATCAA GACCAACGAC 2220 

GAGATCACXX: TGTTCrQQGA CX3ACATAT0G ATTACOGACG TCX3CX7VGCAT CAAGCCXX3AG 2280 

AACCTGAOOG ACAGCGAGAT CAAGCAGATA TACAGTOQCT AOXXATCAA QCTOGAGGAC 2340 

OXMXXrrGA TCGACAAGAA AQGCX3QCATC CACTAOOQCG AGTTCATCAA OGAGQCCAGC 2400 

TTCAACATCG AGCOCCTQCA GAACTAOGTC ACCAAGTACG AGGTCACCTA CAGCAQCX3AG 2460 

CTGGQCCCCA ACGIGAGOGA CACCCTQGAG AGOGACAAGA TTTACAAGGA OQGCAOCATC 2520 

AAGTTCGACT ICACCAAGTA CAGCAAGAAC GAGCAGGGCC TGTTCTAOGA CAGOQQCCTG 2580 

AACroOGACT TCAAGATCAA OQCCATCACC TPCGPOSCCA AOGAGATGAA OGTOITCCAC 2640 

CXXTACAACA AGTAG 2655 
(2) INPORMATION FOR SBQ ID NO: 27: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LEN3TH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA" 

(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: ndsc^feature 

(B) LOCATION: 1..1389 

(D) OTHER INPORMATICXa: /note= "maize pptimized EWA 
sequence encoding VIP2A(a) ** 



(xi) SEQUE3^ICE DESCRIFTIGN: SEQ ID NO:27: 



ATGAAGCGCA 


TGGAGGGCAA 


QCTGTTCATG 


GTGAGCAAGA AOCICCAGGT 


GGTGAOCAAG 


60 


ACCX3TGCTQC 


TGAGCAOOGrr 


GTTCAQCATC 


itfSGCTGCTGA ACAACGAGGT 


GATCAAGGCC 


120 


GAGCAGCTGA 


ACATCAACAG 


CCAGAGCAAG 


TACACCAACC TCCAGAACCT 


GAAGATCACC 


180 


GACAAOGTGG 


AGGACTTCAA 


GGAOGACAAG 


GAGAAGGCCA AGGAGTGGOG 


CAAGGAGAAG 


240 


GAGAAGGAGT 


GGAAGCTTAC 


OGGCACOGAG 


AAGGGCAAGA TGAACAACTT 


OCTQGACAAC 


300 


AAGAACQACA 


TCAAGAOCAA 


CTACAAGGAG 


ATCAOCTTCA GCATAQOOQG 


CAQCITOGAG 


360 


GACGAGATCA 


AGGACCTGAA 


GGAGATOGAC 


AAGATGTICG ACAAGACCAA 


OCTGAOCAAC 


420 


AGCATCATCA 


OCTACAAGAA 


CGTOGAOXC 


AOCACCATOG GCTICAACAA 


GAGCCTGACC 


480 


GAGGGCAACA 


CCATCAACAG 


CGAOGOCATG 


GCCCAGTTCA AOGAGCAGTT 


CCTQGACOGC 


540 


GACATCAAGT 


TOGACAGCTA 


OCTQGACACC 


CACCTGACCG CCCAGCAOGT 


GAGCAGCAAG 


600 


GAGCGCGTGA 


TOCTGAAOGT 


GAOOGTOOCC 


AGCGGCAAGG GCAGCAOCAC 


CCCCAOCAAG 


660 


GCOQGOGTGA 


TOCTGAACAA 


CAOOGAGrrAC 


AAGATQCTGA TCXSACAAOQG 


CTACATQGTG 


720 


CAOGTOGACA 


AGGTGAGCAA 


GGIOGTGAAG 


AAGQGCGTGG AGTGCCTCCA 


GATCGAGGGC 


780 


ACCCTGAAGA 


AGAGrrCTAGA 


CTTCAAGAAC 


GACATCAAOG CCGAGGCCCA 


CAGCTQGGGC 


840 


ATGAAGAACT 


AOGAGGAGTG 


GGOCAAOGAC 


CTGAODGACA GCX»QCCXX3A 


GGOOCTGGAC 


900 


GGCTADGOCC 


GCCAGGACTA 


CAAGGAGATC 


AACAACTACC TOOQCAACCA 


GQGCGGCAGC 


960 


QQCAACGAGA 


AGCTGGAOGC 


CCAGATCAAG 


AACATCAGOG AOGCCCTGOG 


CAAGAAGCOC 


1020 
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ATCCCCGAGA ACATCACCGT GTACCX5CTGG TGCGGCATCC CCGAGTTCGG CTACCAGATC 



1080 



AGCGACCCCC TQCCCAQCCT GAAQGACTTC GAQGAQCAGT TCCTCAACAC CATCAAQGAG 1140 



1200 



1389 



GACAAGQQCT ACATCAGCAC CAGCTTGAGC AGCXSWaOQCC TQGCOXCTr CQQCAGCCQC 
AAGATCATCC TQCXSCTTGCA QGTQCCCAAG GGCAGCACTG GTrOCCTACCT GAGCGCCATC 1260 
GGOQGCITCG CTAQCGAGAA GGAGATCCTG CTQGATAAGG ACAGCAAGTA CXIACATCGAC 1320 
AAGGTGACCG AGGTCATCAT CAAGGGCGTG AAGQGCTACX5 TQOTQGACQC CACCCTOCIC 1380 
Aa:AACTAG — 
(2) INFORMATION FOR SBQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENSriH: 2378 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ENA (genomic) 
(iii) HYPaiHBTICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 2375 

(D) OTHER INFORMATION: /note= "Native ENA sequence 
encoding VlP3A(a) protein from AB88 as contained in pCIB7104" 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:28: 

AGATGAAC ATG AAC AAG AAT AAT ACT AAA TTA AGC ACA AGA GCC TTA CCA 50 
Met Asn Lys Asn Asn Hxr Lys Leu Ser Ote Arg Ala Leu Pro 
15 10 

AGT TIT ATT GAT TAT TIT- AAT QQC ATT TAT QGA TTT GCC ACT GGT ATC 98 
Ser Pbe lie Asp Tyr Phe Asn Gly He lyr Gly Ete Ala Thr Gly He 
15 20 25 30 

AAA GAC ATT ATG AAC ATG ATT TTT AAA ACQ GAT ACA GGT GGT GAT CTA 146 
Lys Asp He Met Asn Met He Phe Lys Thr Asp Ihr Gly Gly Asp Leu 
35 40 45 

ACC CTA GAC GAA ATT TTA AAG AAT CAG CAG TTA CTA AAT GAT ATT TCT 194 
Thr Leu Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Asp He Ser 
50 55 60 

GGT AAA TTG GAT GOG GTG AAT QGA AGC TTA AAT GAT CTT ATC GCA CAG 242 
Gly Lys Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin 
65 70 75 
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GGA AAC TVA AAT ACA GAA TFA TCT AAG GAA ATA TTA AAA ATP GCA AAT 290 
Gly Asn Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn 
80 85 90 

GAA CAA AAT CAA GTT TTA AAT GAT GTT AAT AAC AAA CTC GAT GCG ATA 338 
Glu Gin Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He 
95 100 105 110 

AAT AOG ATG CTT CGG GTA TAT CTA CCT AAA ATT ACC TCT ATG TTG ACT 386 
Asn thr Met Leu Arg Val TV^ Leu Pro Lys He Ttor Ser Met Leu Ser 
_ 115 120 125 

GAT GTA ATG AAA CAA AAT TAT GOG CTA AGT CTG CAA ATA GAA TAC TTA 434 
Asp Val Met Lys Gin Asn Ty^ Ala Leu Ser Leu Gin He Glu lyr Leu 
130 135 140 

AGT AAA CAA TTG CAA GAG ATT TCT GAT AAG TTG GAT ATT ATT AAT GTA 482 
Ser Lys Gin Leu Gin Glu He Ser Asp Lys Leu Asp He He Asn Val 
145 150 155 

AAT GTA CTT ATT AAC TCT ACA CTT ACT GAA ATT ACA CCT GOG TAT CAA 530 
Asn Val Leu He Asn Ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin 
160 165 170 

AGG ATT AAA TAT GTG AAC GAA AAA TIT GAG GAA TTA ACT TIT GCT ACA 578 
Arg He Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr 
175 180 185 190 

GAA ACT AGT TCA AAA GTA AAA AAG GAT GGC TCT CCT GCA GAT ATT CTT 626 
Glu Thr Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp He Leu 
195 200 205 

GAT GAG TTA ACT GAG TTA ACT GAA CTA GCG AAA AGT GTA ACA AAA AAT 674 
Asp Glu Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Ihr Lys Asn 
210 215 220 

GAT GTG GAT GGTT TTT GPA TTT TAC CTT AAT ACA TTC CAC GAT GTA ATG 722 
Asp Val Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met 
225 230 235 

GTA GGA AAT AAT TTA TTC GGG CGT TCA GCT TTA AAA ACT GCA TOG GAA 770 
Val Gly Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Ihr Ala Ser Glu 
240 245 250 

TTA ATT ACT AAA GAA AAT GTG AAA ACA AGT GGC ACT GAG GTC GGA AAT 818 
Leu He Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn 
255 260 265 270 

GTT TAT AAC TIC TTA ATT GTA TTA ACA OCT CTG CAA GCC CAA GCT TIT 866 
Val TVr Asn Phe Leu He Val Leu Thr Ala Leu Gin Ala Gin Ala Ehe 
275 280 285 

CTT ACT TTA ACA ACA TOO CGA AAA TTA TTA GGC TTA GCA GAT ATT GAT 914 
Leu Thr Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp He Asp 
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290 295 300 

TAT ACT TCT ATV ATG AAT GAA CAT TTA AAT AAG GAA AAA GAG GAA TIT 962 
Tyr Thr Ser lie Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe 
305 310 315 



AGA GTA AAC ATC CTC CCT ACA CTT TCT AAT ACT TIT TCT AAT CCT AAT 
Arg Val Asn lie Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn 
320 325 330 



1010 



TAT GCA AAA GTT AAA GGA AGT GAT GAA GAT ^PU^ ATG ATT GIX5 GAA 1058 
Tyr Aitt-liys Val Lys Gly Ser Asp Glu Asp Ala Lys Met He Val Glu 
335 340 345 ^ 350 

GOT AAA CCA GGA CAT GCA TIG ATT QQG TTT GAA ATT ACT AAT GAT TCA 1106 
Ala Lys Pro Gly His Ala Leu He Gly Hie Glu He Ser Asn Asp Ser 
355 360 365 

ATT ACA GTA TTA AAA GTA TAT GAG GCT AAG CTA AAA CAA AAT TAT CAA 1154 
He Ohr Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin 
370 375 380 

GTC GAT AAG GAT TOC TTA TCG GAA GTT ATT TAT QGT GAT ATC GAT AAA 1202 
Val Asp Lys Asp Ser Leu Ser Glu Val He Tyr Gly Asp Met Asp Lys 
385 390 395 

TTA TIG TGC CCA GAT CAA TCT GAA CAA ATC TAT TAT ACA AAT AAC ATA 1250 
Leu Leu Cys Pro Asp Gin Ser Glu Gin He Tyr Tyr Thr Asn Asn He 
400 405 410 

GTA TIT CCA AAT GAA TAT GTA ATT ACT AAA ATT GAT TTC ACT AAA AAA 1298 
Val Phe Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys 
415 420 425 430 

ATC AAA ACT TTA AGA TAT GAG GTA ACA GCG AAT TTT TAT GAT TCV TCT 1346 
Met Lys Thr Leu Arg Tyr Glu Val Ihr Ala Asn Phe lyr Asp Ser Ser 
435 . 440 445 

ACA GGA GAA ATT GAC TTA AAT AAG AAA AAA CTA GAA TCA AGT GAA GOG 1394 
Thr Gly Glu He A^ Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala 
450— 455 460 

GAG TAT AGA ACG TTA ACT GCT AAT GAT GAT GOG GTC TAT ATC CCG TTA 1442 
Glu Tyr Arg Thr Leu Ser Ala Asn Asp Asp Gly Val IVr Met Pro Leu 
465 470 475 

GCT GTC ATC ACT GAA ACA TTT TTC ACT COG ATT AAT QGG TTT GGC CTC 1490 
Gly Val He Ser Glu Ohr Phe Leu Thr Pro He Asn Gly Phe Gly Leu 
480 485 490 

CAA GCT GAT GAA AAT TCA AGA TTA ATT ACT TTA ACA TCT AAA TCA TAT 1538 
Gin Ala Asp Glu Asn Ser Arg Leu He Ohr Leu Thr Cys Lys Ser Tyr 
495 500 505 5IO 

TTA AGA GAA CTA CTC CTA GCA ACA GAC TTA AGC AAT AAA GAA ACT AAA 1586 
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Leu Arg Glu Leu L<eu Leu Ala Thr Asp Leu Ser Asn Lys Glu Ttir Lys 
515 520 525 

TIG ATC GTC CCG CX:A ACT GGT TTT ATT AGC AAT ATT GTTA GAG AAC QGG 1634 
Leu lie Val Pro Pro Ser Gly Phe He Ser Asn He Val Glu Asn Gly 
530 535 540 

TCC ATA GAA GAG GAC AAT TTA GAG CX:X3 TOG AAA OCA AAT AAT AAG AAT 1682 
Ser He Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn 
545 550 555 

GCG TAT GTA GAT C^ACA GQC GGA GTG AAT QGA ACT AAA GCT TTA TAT T730 
Ala Tyr Val Asp His Thr Gly Gly Val Asn Gly-Uir Lys Ala Leu Tyr 
560 565 570 

GTIT CAT AAG GAC GGA GGA ATT TCA CAA TIT ATT GGA GAT AAG TTA AAA 1778 
Val His Lys Asp Gly Gly He Ser Gin Phe He Gly Asp Lys Leu Lys 
575 580 585 590 

CCG AAA ACT GAG TAT GTA ATC CAA TAT ACT GTT AAA GGA AAA CCT TCT 1826 
Pro Lys Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser 
595 600 605 

ATT CAT TTA AAA GAT GAA AAT ACT QGA TAT ATT CAT TAT GAA GAT ACA 1874 
He His Leu Lys Asp Glu Asn TSir Gly lyr He His Tyr Glu Asp Thr 
610 615 620 

AAT AAT AAT TEA GAA GAT TAT CAA ACT ATT AAT AAA OCT TTT ACT ACA 1922 
Asn Asn Asn Leu Glu Asp Tyr Gin Thr He Asn Lys Arg Phe Thr Thr 
625 630 635 

GGA ACT GAT TTA AAG QGA GTG TAT TTA ATT TTA AAA AGT CAA AAT GGA 1970 
Gly Thr Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser Gin Asn Gly 
640 645 650 

GAT GAA OCT TSG GGA GAT AAC TIT ATT ATT TTG GAA ATT AGT CCT TCT 2018 
Asp Glu Ala Trp Gly Asp Asn Phe He He Leu Glu He Ser Pro Ser 
655 660 665 670 

GAA AAG TTA TTA AGT CCA GAA TTA ATT AAT ACA AAT AAT TOG ACG AGT 2066 
6ttt Lys Leu Leu Ser Pro Glu Leu He Asn Thr Asn Asn Trp Thr Ser 
675 680 685 

AOG GGA TCA ACT AAT ATT AGC GGT AAT ACA CTC ACT CTT TAT GAG QGA 2114 
Ihr Gly Ser Thr Asn He Ser Gly Asn Thr Leu Thr Leu Tyr Gin Gly 

690 695 700 

GGA OGA GGG ATT CTA AAA CAA AAC CTT CAA TEA GAT AGT TTT TCA ACT 2162 
Gly Arg Gly He Leu Lys Gin Asn Leu Gin Leu Asp Ser Phe Ser Thr 
705 710 715 

TAT AGA GTC TAT TTT TCP GTG TCC GGA GAT GCT AAT GTA AGG ATT AGA 2210 
Tyr Arg Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg He Arg 
720 725 730 
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AAT TCT AGG GAA GTG TTA TTT GAA AAA AGA TAT ATG AGC GGT GCT AAA 2258 
Asn Ser Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys 
735 740 745 750 

GAT GIT TCT GAA ATG TTC ACT ACA AAA TTT GAG AAA GAT AAC TIT TAT 2306 
Asp Val Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr 
755 760 765 

ATA GAG CTT TCT CAA GGG AAT AAT TTA TAT GGT GGT CCT ATT GTA CAT 2354 
He Glu Leu Ser Gin Gly Asn Asn Leu Tyr Gly Gly Pro He Val His 
— 770 775 780 

TTT TAG GAT GTC TCT ATT AAG TAA 2378 
Phe Tyr Asp Val Ser He Lys 
785 



(2) INFXDRMATION FOR SBQ ID ND:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE2I3TH: 789 amino acids 

(B) TVTPE: amino acid 
(D) TOPOLOGy: linear 

(ii) MOLECULE TYPE: protein 

(xi) SBQUEWCE DESCRIPTION: SEQ ID NO:29: 

Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Ehe 
15 10 15 

He Asp Tyr Kie Asn Gly He Tyr Gly Rae Ala Thr Gly He Lys Asp 
20 25 30 

He Met Asn Met He Phe Lys Thr A^ Thr Gly Gly Asp Leu Thr Leu 
35 40 45 

Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Asp He Ser Gly Lys 
50 55 60 

Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 
65 70 75 80 

Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 

Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Thr 
100 105 110 

Met Leu Arg Val TV^ Leu Pro Lys He Thr Ser Met Leu Ser Asp Val 
115 120 125 

Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Leu Ser Lys 
130 135 140 
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Gin Leu Gin Glu He Ser Asp Lys Leu Asp He He Asn Val Asn Val 
145 150 155 160 

Leu lie Asn Ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin Arg He 
165 170 175 

Lys ayr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr 
180 185 190 

Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp He Leu Asp Glu 
195 200 205 

Leu Thr GlTTI^u Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 

210 215 220 

Asp Gly Phe Glu Phe TVr Leu Asn Thr Phe His Asp Val Met Val Gly 
225 230 235 240 

Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
245 250 255 

Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val TVr 
260 265 270 

Asn Phe Leu He Val Leu Thr Ala Leu Gin Ala Gin Ala Phe Leu Thr 
275 280 285 

Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp He Asp TVr Thr 
290 295 300 

Ser He Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Am Val 
305 310 315 320 

Asn He Leu Pro Thr Leu Ser Asn Thr Kie Ser Asn Pro Asn Tyr Ala 
325 330 335 

Lys Val Lys Gly Ser A^p Glu Asp Ala Lys Met He Val Glu Ala Lys 
340 345 350 

Pro Gly His Ala Leu He Gly Phe Glu He Ser Asn Asp Ser He Thr 
355 360 365 

Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin Val Asd 
370 375 380 

I 

Lys Asp Ser Leu Ser Glu Val He Tyr Gly Asp Met Asp Lys Leu Leu 
385 390 395 400 

Cys Pro Asp Gin Ser Glu Gin He Tyr Tyr Thr Asn Asn He Val Phe 
405 410 415 

Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys Met Lys 
420 425 430 

Thr l^u Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
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435 



440 



445 



Glu lie Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr 
450 455 460 

Arg Ihr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
465 470 475 480 

lie Ser Glu Ihr Phe Leu Thr Pro He Asn Gly Phe Gly L«j Gin Ala 
485 490 495 

Asp Glu Asn Ser Arg Leu He Ihr Leu Ihr Cys Lys Ser Tyr Leu Arg 
500 505 510 — 

Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Ihr Lys Leu He 
515 520 525 

Val Pro Pro Ser Gly Phe He Ser Asn He Val Glu Asn Gly Ser He 
530 535 540 

Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala *iyr 
545 550 555 560 

Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His 
565 570 575 

Lys A^ Gly Gly He Ser Gin Pbe lie Gly Asp Lys Leu Lys Pro Lys 
580 585 590 

Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser He His 
595 600 605 

Leu Lys Asp Glu Asn Ihr Gly Tyr He His Tyr Glu Asp Thr Asn Asn 
610 615 620 

Asn Leu Glu Asp Tyr Gin Thr He Asn Lys Arg £he Ihr Thr Gly ihr 
625 630 635 640 

Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser Gin Asn Gly Asp Glu 
645 650 655 

Ala Trp Gly Asp Asn Phe He He Leu Glu He Ser Pro Ser Glu Lys 
660 665 670 

Leu Leu Ser Pro Glu Leu He Asn Thr Asn Asn Trp Thr Ser Ihr Gly 
675 680 685 

Ser Thr Asn He Ser Gly Asn Thr Leu Thr Leu Tyr Gin Gly Gly Arg 
690 695 700 

Gly He Leu Lys Gin Asn Leu Gin Leu Asp Ser Phe Ser Thr Tyr Arg 
705 710 715 720 



Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg He Arg Asn Ser 
725 730 735 
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Pxg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val 
740 745 750 

Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe IVr He Glu 
755 760 765 

Leu Ser Gin Gly Asn Asn Leu Tyr Gly Gly Pro He Val His Phe Tyr 
770 775 780 

Asp Val Ser He Lys 
785 

(2) INFORMATION FOR SBQ ID NO:30: 

(i) SBQUQCE CHARACTERISTICS: 

(A) LENGTH: 2403 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single. 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic ENA" 

(iii) HYPOTOETICAL: MO 



(ix) FEATURE: 

(A) NAME/KEY: raisc_feature 

(B) LOCATION: 11.. 2389 

(D) OTHER INPOPMPnuW: /note= "maize optimized ENA 
sequence encoding VIP3A{a) " 



(xi) SBQUQ^.DESCRIFTICN: SBQ ID NO: 30: 

GGATOCACCA ATGAACATGA ACAAGAACAA CACCAAOCTG AGCAOOOOCG CCCTGCCGAG 60 

CTTCATCGAC TACTTCAAOG QCATCTACQG CTTOGCCAOC QQCATCAAGG ACATCATC3AA 120 

CASTCATCTTC AAGACOCaOl COGOaSQOa 180 

GCAi3CTGCTG AACGACATCA GOQGGAAGCT GGAOQGCGrTG AAOQGCAGCC TGAAOGACCT 240 

GATOQCCCAG GGCAAOCTGA ACAD0C5AGCT GAGCAAGGAG ATOCTTAAGA TOGCCAACGA 300 

OCAGAADCAG GTGCTGAACG ACG?PGAACAA CAAQCTQGAC QCCATCAACA CCATOCTGOG 360 

OGTOEACCTG CCGAAGATCA OCAGCATGCT GAGCGAOSTG ATGAAGCAGA ACTACGCCCT 420 

GAGCCTQCAG ATCXSACSTTACC TGAGCAAQCA GCTOCAGGAG ATCAGCGACA AOCTQGACAT 480 

CATCAACGTG AACGTCCTGA TGAACAGCAC CCTGACOGAG ATCAOCCCGG CCTACCAGQG 540 

CATCAAGTAC GTGAACGAGA ACTTOGAAGA GCTGACCTTC GCCACCGAGA CCAGCAGCAA 600 
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QGTC3AAGAAG GACX3QCAGCC CQQCOGACAT CCTOGAOGAG CTGACCGAGC TGACCGAQCT 660 

QCXCAAGAGC GTCAiXAAGA ACGACCTX3GA OQQCTrcGAG TTCTACCTGA ACACCITCCA 720 

CX3ACGTCATG GTGGGCAACA ACCTGTTOGG CXX3CAGCGCC CTGAAGACCG CCAGCX3AGCT 780 

GATCACCAAG GAGAADG?IGA AGACXIAGCXSG CMSOGPiGGTG QQCAACGTCT ACAACTICCT 840 

GATOGTOCTG PJXGCCCTCC AQOOOCAOSC CnTCCTGACX: CTGACXAOCT GTOGCAAQCT 900 

GCTQQGCX7K5 GCCXSACATOG-^TACAa^ CATCATGAAC GAQCSCTTCA ACAMGAGAA 960 

GGAGGAGITC CGCGTGAACA TCCTQ0CC3AC CCIGAGCAAC AOCTICAGCA ACCCGAACTA 1020 

CGCCAAGGTG AAGGGCAG0C3 ACGAQGAOQC CAAGATCATC GTQGAGGCTA AGCaXSOCCA 1080 

aSCGriTGATC GGCnCGAGA TCAGCAADGA CAGCATCACC GTCCTGAftGG TCTACGflGQC 1140 

CAAGCTGAAG CAGAACTACC AQGTQQACAA QQACAGCTTG AQC3GAGGTOA TCTAOSQCXSA 1200 

CMQGACAAG CTGCTffPGTC OQGAOCAGAG 0C3MCAAATC TACTftCACCA ACAACATOST 1260 

GTOXCGAAC GAGTAOGTGA TCAOCAAGAT OCSAdTCaCC AAGAAGATGA AGftCXCTOC G 1320 

CTACGAQGTC ADCXXXAACT TCTAOGACAG CMCACOQGC GAGATOGACC TGAACAAGAA 1380 

GAAGGTCGAG AGCAQCGAQG CCGAGTAOCG CAOXTCAQC GCGAAOSACXS AOGGCXnCTA 1440 

CATOXACTG GQCX3TGATCA GCGAGACCIT CCVGP€CQCG ATCAACX30CT TTOGCCTOCA 1500 

CSGCOGAOGftG AACAGCXX3CC TGATCMXXT GACCTCrTAAG AQCTACCTQC GCXaOCTOCT 1560 

GCTAGCCADC GACCTCAGCA ACAAGGaCAC QUVSOTGATC GTGCXTtfXGA QCX3GCTTCAT 1620 

CAGCAACATC GrTGGAGAAOS QCAQCATOGA QGAOGACAAC CTOGAGCOGT C3GAAGGCCAA 1680 

CAACAAGAAC GCCTAOGTOG ACCACA0CX3G COQOCTGAAC GGCACCAAGG CGCTCTAOSrr 1740 

GCACAAQGAC QQCX3QCATCA QCXZAGTICAT CX3Qa3ACAAG CTGAAGCCGA AGAOCGAGTA 1800 

OSTCATOCAG TACACOGTCA AQGQCAAGCC ATOGATTCAC CTGAAGGAOG AGAACAOOQG 1860 

CTACATCCAC TACXSAGGACA CX:AACAACAA CCTOGAQGAC TADCAGAOCA TCAACAAGCC 1920 

CITCACCACC GQCACCGACC TGAAQQOCGT GTAOCTOATC CTQAAGAGCC AGAACX3QCGA 1980 

CGAOGOCTGG QQCGACAACT TCATCATCCT QGAGATCAGC CXX3AQCGAGA AQCTOCTGAG 2040 

CCXX3GAGCTG ATCAACACCA ACAACTQGAC CAGCACOQQC AGCACCAACA TCAQCXSGCAA 2100 

CACOCTGACC CIXnACCAGG GOGQCXX3CX3G OVTCCTGAAG CAGAACCTQC AGCTOGACAG 2160 

CTICAGCACC TAOxxxsrrGrr AcrrcAocxTr gagoqgcxsac gcctvaogtqc gcatcoqcaa 2220 
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CAGCXXSOGAG GTGCICTTCG AGAAGAGCTTA CATGAGCGQC GCCAAQGACG TCAGCGAGAT 2280 
GTTCACCACC AAGTTOGAGA AGGACAACTT CTACATCGAG CTGAGCCAQG GCAACAACCT 2340 
GTAOQQOGGC C0GATC3GTQC ACOTCTAOSA OGTGAGCATC AAGTTAAOGT AGAGCTCAGA 2400 
TCT 2403 
(2) INFORMATION FOR SEQ JD N0:31: 

(i) SEQUENCE OlARACTERICTICS ; 

(A) LEN3IU:-2€±2 base pairs 

(B) TOPE: nucleic acid 

(C) STRANI^XNESS: single 

(D) TOPOLOGVT: linear 

(ii) MDLBCXJLE TVPE: CNA (genondc) 
(iii) HYPOTHETICAL: ND 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 118. ,2484 

(D) aiWEK INFORMATION: /note= "Native CNA sequence 
encoding VIP3A(b) from AB424" 



(xi) SEQUENCE raSCRIPriCN: SBQ ID NO: 31: 
ATTGAAATTG ATAAAAAGTT ATGAGTGTTT AATAATCAGTT AATTACCAAT AAAGAATTAA 60 

GAATACAAGT TTACAAGAAA TAAGTGTTAC AAAAAATAGC TGAAAAGGAA GATGAAC 117 

ATG AAC AAG AAT AAT ACT AAA TPA AGC ACA AGA GCX: TTA.OCA ACT TTT 165 
Met Asn Lys Asn Asn Ihr Lys Leu Ser Thr Axg Ala Leu Pro Ser Phe 
790 795 800 805 

ATT GAT TAT TTC AAT GGC ATT TAT GGA TIT GCX: ACT GGT ATC AAA GAC 213 
He Asp IVr Phe Asn Gly He Tyr Gly Phe Ala Thr Gly He Lys Asp 
810 815 820 

ATT ATG AAC ATG ATT TTT AAA ACG GAT ACA GGT GGT GAT CPA ACC CTA 261 
He Met Asn Met He Phe Lys Hhr Asp Thr Gly Gly Asp Leu Tbr Leu 
825 830 835 

GAC GAA ATT TTA AAG AAT GAG CAG CTA CTA AAT GAT ATT TCT GGT AAA 309 
Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Asp He Ser Gly Lys 
840 845 850 

TTO GAT GGG GTG AAT GGA AGC TTA AAT GAT CTT ATC OCA CAG GGA AAC 357 
Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 

855 860 865 

TTA AAT ACA GAA TTA TCT AAG GAA ATA TTA AAA ATT GCA AAT GAA CAA 405 
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Leu Asn Thr Glu Leu Sex- Lys Glu He Leu Lys He Ala Asn Glu Gin 
870 875 880 885 

MT CAA GTT TTA AAT GAT GTT AAT AAC AAA Cnx: GAT GOT ATA AAT ACG 453 
Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Thr 
890 895 900 

ATG CIT CXX5 GTA TAT CTA CCT AAA ATT ACC TCT ATG TTC GAT GTA 501 
Met Leu Arg Val T/r Leu Pro Lys He Ttir Ser Met Leu Ser Asp Val 
905 910 915 

J^TCTTffiA CAA AAT TAT GCG CTA_AGT CTG CAA ATA GAA TAC TTA ACT AAA 549 
Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Lea Ser Lys 
920 925 930 

CAA TIG CAA GAG ATT TCT GAT AAG TTG GAT ATT ATT AAT GTA AAT CTA 597 
Gin Leu Gin Glu He Ser Asp Lys Leu Asp He He Asn Val Asn Val 
935 940 945 

CTT ATT AAC TCT ACA CTT ACT GAA ATT ACA CGT GOG TAT CAA AGG ATT 645 
Leu He Asn Ser Thr Leu Tbr Glu He Thr Pro Ala Tyr Gin Arg He 
950 955 960 965 

AAA TAT GTG AAC GAA AAA TTT GAG GAA TTA ACT TTT OCT ACA GAA ACT 693 
Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Ohr Glu Thr 
970 975 980 

ACT TCA AAA CTA AAA AAG GAT GGC TCT CCT GCA GAT ATT CCT GAT GAG 741 
Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp He Arg Asp Glu 
985 990 995 

TTA ACT GAG TTA ACT GAA CTA GCG AAA ACT CTA ACA AAA AAT GAT GIG 789 
Leu Ihr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
1000 1005 1010 

GAT GCT TTT GAA TTT TAC CTT AAT ACA TIC CAC GAT CTA ATC CTA GGA 837 
Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly 
1015 1020 1025 

AAT AAT TTA TTC GOG CXH*. TCA GCT TTA AAA ACT GCA TOG GAA TTA ATT 885 
Asn Asn Leu-%e Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
1030 1035 1040 1045 

ACT AAA GAA AAT GTG AAA ACA ACT GGC ACT GAG GTC QGA AAT GTT TAT 933 
Thr Lys Glu Asn Val Lys Ihr Ser Gly Ser Glu Val Gly Asn Val Tyr 
1050 1055 1060 

AAC TIC CTA ATT CTA TTA ACA GCT CTG CAA OCA AAA GCT TIT CTT ACT 981 
Asn E^e Leu He Val Leu Thr Ala Leu Gin Ala Lys Ala Phe Leu Thr 
1065 1070 1075 

TTA ACA CCA TGC OGA AAA TTA TTA GGC TTA GCA GAT ATT GAT TAT ACT 1029 
Leu Thr Pro Cys Arg Lys Leu Leu Gly Leu Ala Asp He Asp Tyr Thr 
1080 1085 1090 
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TCT ATI' ATG AAT GAA CAT TTA AAT AAG GAA AAA GAG GAA TTT AGA GTA 1077 
Ser lie Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val 
1095 1100 1105 

AAC ATC CTC CCT ACA CTT TCT AAT ACT TTT TCT AAT CCT AAT TAT CCA 1125 
Asn He Leu Pro Ihr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
1110 1115 1120 1125 

AAA GIT AAA QGA AGT GAT GAA GAT GCA AAG ATC ATT GTG GAA OCT AAA 1173 
Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met He Val Glu Ala Lys 
1130 1135 1140 

CCA GGflTcAT GCA TTG ATT GOG TTT -GAA ATT AGT AAT GAT TCA ATT ACA 1221 
Pro Gly His Ala Leu He Gly Ptie Glu He Ser Asn Asp Ser He Hir 
1145 H50 1155 

GTA TTA AAA GTA TAT GAG GCT AAG CTA AAA CAA AAT TAT CAA GTC GAT 1269 
Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin Val Asp 
1160 1165 1170 



AAG GAT TCC TTA TCG GAA GTT ATT TAT GGC GAT ATG GAT AAA TTA TIG 1317 
Lys Asp Ser Ser Glu Val He Tyr Gly Asp Met Asp Lys Leu Leu 
1175 1180 1185 

TGC CCA GAT CAA TCT QGA CAA ATC TAT TAT ACA AAT AAC ATA GTA TIT 1365 
cys Pro Asp Gin Ser Gly Gin He Tyr Tyr Thr Asn Asn He Val Phe 
1190 1195 1200 1205 

CCA AAT GAA TAT GTA ATT ACT AAA ATT GAT TTC ACT AAA AAA ATG AAA 1413 
Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys Met Lys 
1210 1215 1220 

ACT TTA AGA TAT GAG GTA ACA GOG AAT TIT TAT GAT TCT TCT ACA QGA 1461 
Thr Leu Arg Tyr Glu Val Tiir Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
1225 1230 1235 

GAA ATP GAC TTA AAT AAG AAA AAA GTA GAA TCA AGT GAA GCG GAG TAT 1509 
Glu He Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr 
1240 1245 1250 

AGA ACG TTA AGT GCT AAT GAT GAT GOG GTG TAT ATG COG TTA GGT GTC 1557 
Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
1255 1260 1265 

ATC AGT GAA ACA TTT TIG ACT COG ATT AAT GGG TTT GGC CTC CAA GCT 1605 
He Ser Glu Thr Phe Leu Thr Pro He Asn Gly Phe Gly Leu Gin Ala 
1270 1275 1280 1285 

GAT GAA AAT TCA AGA TIA ATT ACT TTA ACA TGT AAA TCA TAT TTA A^ 1653 
Asp Glu Asn Ser Arg Leu He *fllir Leu Thr Cys Lys Ser Tyr Leu Arg 
1290 1295 1300 

GAA CTA CTG CTA GCA ACA GAC TTA AGC AAT AAA GAA ACT AAA TIG ATC 1701 
Glu L^u Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu He 
1305 1310 1315 
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GTC COG CCA ACT GCT TTT ATT AGC AAT ATT CTA GAG AAC GGG TCC ATA 1749 
Val Pro Pro Ser Gly Phe lie Ser Asn lie Val Glu Asn Gly Ser He 
1320 1325 1330 

GAA GAG GAC AAT TTA GAG CXG TOG AAA GCA AAT AAT AAG AAT GCG TAT 1797 
Glu Glu Asp Asn Leu Glu Pro Ttp Lys Ala Asn Asn Lys Asn Ala Tyr 
1335 1340 1345 

GTA GAT CAT ACA QQC GGA GTG AAT QQA ACT AAA GCT TTA TAT GIT CAT 1845 
Val Asp His Ihr Gly Qly-Val Asn Gly Ihr Lys Ala Leu Tyr Val His- 
1350 > 1355 1360 1365 

AAG GAC GGA QGA ATT TCA CAA TTT ATT GGA GAT AAG TTA AAA CCG AAA 1893 
Lys Asp Gly Gly He Ser Gin Phe He Gly Asp Lys Leu Lys Pro Lys 
1370 1375 1380 

ACT GAG TAT CTA ATC CAA TAT ACT GTT AAA GGA AAA OCT TCT ATT CAT 1941 
Ihr Glu Tyr Val He Gin Tyr Ihr Val Lys Gly Lys Pro Ser He His 
1385 1390 1395 

TTA AAA GAT GAA AAT ACT GGA TAT ATT CAT TAT GAA GAT ACA AAT AAT 1989 
Leu Lys Asp Glu Asn Thr Gly oyr He His Tyr Glu Asp Thr Asn Asn 
1400 1405 1410 

AAT TTA GAA GAT TAT CAA ACT ATT AAT AAA OCT TTT ACT ACA GGA ACT 2037 
Asn Leu Glu Asp lyr Gin Thr He Asn Lys Arg Phe Thr Thr Gly Thr 
1415 1420 1425 

GAT TTA AAG GGA GTG TAT TTA ATT TTA AAA ACT CAA AAT QGA GAT GAA 2085 
Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser Gin Asn Gly Asp Glu 
1430 1435 1440 1445 

GCT TGG GGA GAT AAC TTT ATT ATT TTG GAA ATT ACT C3CT TCT GAA AAG 2133 
Ala Trp Gly Asp Asn Phe He He Leu Glu He Ser Pro Ser Glu Lys 
1450 1455 1460 

TTA TTA ACT OCA GAA TTA ATT AAT ACA AAT AAT TOG AOG ACT AOG QGA 2181 
Leu Ser Pro Glu Leu He Asn Thr Asn Asn Trp Thr Ser Thr Gly 
1465 1470 1475 

TCA ACT AAT ATT AGC GCT AAT ACA CTC ACT CTT TAT CAG GGA QGA CGA 2229 
Ser Thr Asn He Ser Gly Asn Thr Leu Thr Leu Tyr Gin Gly Gly Arg 
1480 1485 1490 

GGG ATT CTA AAA CAA AAC CTT CAA TTA GAT ACT TTT TCA ACT TAT AGA 2277 
Gly He Leu Lys Gin Asn Leu Gin Leu Asp Ser E^e Ser Thr Tyr Arg 
1495 1500 1505 

GTG TAT TTC TCT GTG TCC GGA GAT GCT AAT CTA AGG ATT AGA AAT TCT 2325 
Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg He Arg Asn Ser 
1510 1515 1520 1525 

AGG GAA GTG TTA TTT GAA AAA AGA TAT ATG AGC GCT GCT AAA GAT GTT 2373 
Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val 
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1530 1535 1540 

TCT GAA ATG TTC ACT ACA AAA TIT GAG AAA GAT AAC TTC TAT ATA GAG 2421 
Ser Glu Met Etie Thr Htxr Lys Phe Glu Lys Asp Asn Phe Tyr lie Glu 
1545 1550 1555 

err TCT CAA GGG AAT AAT TTA TAT GGT QGT OCT ATT GTA CAT TIT TAG 2469 
Leu Ser Gin Gly Asn Asn Leu Tyr Gly Gly Pro lie Val His Phe Tyr 
1560 1565 1570 

GKV GTC TCT ATT AAG TAAGATOQQG ATCTAATATT AACAGTTTIT AGAAQCTAAT 2524 
Asp Val Ser He Lys 
1575 _ 

TCTTGTATAA TGTCdTGAT TATQGAAAAA CACAATTTTG TITGCrAAGA TGTATATATA 2584 
GCICACTCAT TAAAAQGCAA TCAAQCTT 2612 

(2) INFORMATION TOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE^XSriH: 789 amino acids 

(B) TVTPE: andno acid 
(D) TOPOLOGY: linear 

(ii) MOTiFTTJLS TYPE: protein 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO:32: 

Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Phe 
1 5 10 15 

He Asp Tyr Phe Asn Gly He TVr Gly Phe Ala thr Gly He Lys Asp 
20 25 30 

He *tet Asn Met He Phe Lys Thr Asp Ihr Gly Gly A^p Leu Thr Leu 
35 40 45 

Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Asp He Ser Gly Lys 
50 55 60 

Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 
65 70 75 80 

Leu Asn Ihr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 

Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Ihr 
100 105 110 

Met Leu Arg Val IVr Leu Pro Lys He Ihr Ser Met Leu Ser Asp Val 
115 120 125 

Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Leu Ser Lys 



wo 97/46105 



PCT/EP97/02737 



- 102 - 



130 135 140 

Gin Leu Gin Glu lie Ser Asp Lys Leu Asp He lie Asn Val Asn Val 
145 150 155 

Leu He Asn Ser Thr Leu Thr Glu He Ihr Pro Ala Tyr Gin Arg He 
165 170 175 

Lys Tyr Val Asn Glu Lys E*ie Glu Glu Leu Thr Phe Ala Thr Glu Thr 
180 185 190 

Ser Ser Lys Val Lys Lys Asp Gly Serjto Ala Asp-fle Arg Asp Glu 
19S 200 205 

Leu Thr Glu Leu 'Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
210 215 220 

Asp Gly Phe Glu Phe IVr Leu Asn Thr Phe His Asp Val Met Val Gly 
225 230 235 240 

Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
245 250 255 

Ihr Lys Glu Asn Val Lys Ihr Ser Gly Ser Glu Val Gly Asn Val Tyr 
260 265 270 

Asn Phe Leu He Val Leu rhr Ala Leu Gin Ala Lys Ala Phe Leu Thr 
275 280 285 

Leu liir Pro Cys Arg Lys Leu Leu Gly Leu Ala Asp He Asp Tyr Thr 
290 295 300 

Ser He Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val 
305 310 315 320 

Asn He Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
325 330 335 

Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met He Val Glu Ala Lys 
340 345 350 

Pro Gly His Ala Leu He Gly Phe Glu He Ser Asn Asp Ser He Thr 
355 360 . 365 

Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin Val Asp 
370 375 380 

Lys Asp Ser teu Ser Glu Val He Tyr Gly Asp Met Asp Lys Leu Leu 
385 390 395 400 

Cys Pro Asp Gin Ser Gly Gin He Tyr lyr Thr Asn Asn He Val Phe 
405 410 415 

Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys Met Lys 
420 425 430 
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Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
435 440 445 

Glu lie Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr 
450 455 460 

Arg Ihr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
465 470 475 480 

He Ser Glu Thr Phe Leu Thr Pro He Asn Gly Phe Gly Leu Gin Ala 
485 490 495 ^ 

Asp Glu Asn Ser Arg Leu He Thr Leu Thr Cys Lys Ser Tyr Leu Arg 
500 505 . 510 

Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu He 
515 520 525 

Val Pro Pro Ser Gly Phe He Ser Asn He Val Glu Asn Gly Ser He 
530 535 540 

Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr 
545 550 555 560 

Val Asp His Thr Gly Gly Val Asn Gly Ihr Lys Ala Leu Tyr Val His 
565 570 575 

Lys Asp Gly Gly He Ser Gin Phe He Gly Asp Lys Leu Lys Pro Lys 
580 585 590 

Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser He His 
595 600 605 

Leu Lys Asp Glu Asn Thr Gly Tyr He His Tyr Glu Asp Thr Asn Asn 
610 615 620 

Asn Leu Glu Asp Tyr Gin Thr He Asn Lys Arg Phe Thr Thr Gly Ihr 
625 630 635 640 

Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser Gin Asn Gly Asp Glu 
645 650 655 

Ala Trp Gly Asp Asn ttie He He Leu Glu He Ser Pro Ser Glu Lys 
660 665 670 

Leu Leu Ser Pro Glu Leu He Asn Ihr Asn Asn Trp Thr Ser Thr Gly 

675 680 685 

Ser Thr Asn He Ser Gly Asn Ihr Leu Thr Leu Tyr Gin Gly Gly Arg 
690 695 700 



Gly He Leu Lys Gin Asn Leu Gin Leu Asp Ser Phe Ser Thr Tyr Arg 
705 710 715 720 
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Val 'Vyr Phe Ser Val Ser Giy Asp Ala Asn Val Arg He Arg Asn Ser 
725 730 735 

Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val 
740 745 750 

Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr He Glu 
755 760 765 

Leu Ser Gin Gly Asn Asn Leu Tyr Gly Gly Pro He Val His Phe TVr 
770 775 780 

Asp Val Ser lie Lys 
785 

(2) INFORMATION FOR SBQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEWGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: other nucleic acid 

(A) DESCRIPTIGN: /desc = "forward primer \ised to make 

PCIB5526'* 

(iii) HYPOriHBriCAL: NO 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 33: 
GGATCCACCA TGAAGACCAA OCAGATCAGC 3q 
(2) INFORMATION FOR SBQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENoTIH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) EffiSCRIPTICN: /desc = "reverse primer used to make 
PCIB5526" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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AAGCTTCAGC TCCTT 

(2) INFX)RMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARftCTERISTICS : 

(A) LENGIH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STE^ANDEIXJESS : single 

(D) TOPOIOOT: linear 

(ii) MOLBCUI£ TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic ENA" 

(iii) HYPOTHETICAL: NO 



(ix) FEATORE: 

(A) NAME/KEY: CDS 

(B) LOCATICN: 9. .2564 

(D) oraER INFORMATION: /note= "Maize optimized sequence 
encoding VIPlA(a) viith the Bacillus secretion signal renoved as 
contained in pCIB5526" 



(xi) SBQUE3SCE DESCRIPTION: SBQ ID NO: 35: 

GATOCACC ATG AAG ACC AAC CAG ATC AGC ACC ACC CAG AAG AAC CAG CAG 50 
Met Lys Thr Asn Gin He Ser Thr Ttir Gin Lys Asn Gin Gin 
825 ' 830 835 



AAG GAG ATG GAC CGC AAG QQC Crc CTG GQC TAG TAC TK: AAG QQC AA^ 
Lys Glu Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys 
840 845 850 



98 



GAC TTC AGC AAC CTG ACC ATG TTC GCC CCC ACG CGT GAG AGC ACC CTC 146 
Asfp Phe Ser Asn Leu Ihr Met Phe Ala Pro Thr Arg Asp Ser Ihr Leu 
855 860 865 

ATC TAG GAC CAG CAG ACC GCC AAC AAG CTG CTG GAC AAG AAG CAG CAG 194 
He Tyr Asp Gin Gin Thr Ala Asn Lys Leu Leu Asp Lys Lys Gin Gin 
870 875 880 

GAG TAC CAG AGC ATC CQC TOG ATC GGC CTS ATC CAG AGC AAG GAG ACC 242 
Glu Tyr Gin Ser He Arg Trp He Gly Leu He Gin Ser Lys Glu Thr 
885 890 895 

GGC GAC TIC ACC TTC AAC CTG AGC GAG GAC GAG CAG GCC ATC ATC GAG 290 
Gly Asp Phe Thr Phe Asn Leu Ser Glu Asp Glu Gin Ala He He Glu 
900 905 910 915 

ATC AAC GGC AAG ATC ATC AGC AAC AAG GGC AAG GAG AAG CAG GTC (?IG 338 
He Asn Gly Lys He He Ser Asn Lys Gly Lys Glu Lys Gin Val Val 
920 925 930 



CACCTGGAGAAGGGCAAGCTGGTG CCC ATC AAG ATC GAG TAC CAG AGC 



386 
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482 



530 



578 



626 



674 



His i^u Glu Lys Gly Lys Leu Val Pro lie Lys He Glu Tyr Gin Ser 

940 

GAC ACC MG ITC AAC ATC 

ASP Oto Lys Phe Asn He Asp Ser Lys Ihr *e Lys Glu S ^ 

955 

Phe Lys lie Asp Ser Gin Asn Gin Pro Gin Gin Val Gin ^ 
CTG CGC AAC CCC GAG^ AAC AaTaM GAG AGC 

Leu Arg Asn Pro Glu Phe Asn Lys Lys Glu Ser G& ^ ^ S 

990 

AAGOCTAGCAAGATCAACCTCTrcACCCftGCAGATCAflGCQCGar^'Tr' 
Lys Pro ser Lys lie Asn LeuIte^GlnSSS^S^SSJS 
1000 1005 1010 

GACGAGGACACCGACACCGACQGCGACAGCATCCCCGACCTOTOGGM 

ASP Glu ASP Ohr A^ Ihr A^ Gly A^ ser lie Pro S S S 
1015 1020 1025 

GAGAACQKTACACTATCCAGAACCXrATCGOCCTGAAGTCGGAC 
Glu Asn G y OVr Thr lie Gin Asn A^ He Ala Val Lys ^ ^ ^ 

1035 1040 

AO: era OCT AGC AAG GGC TAC AOC AAG TIC GIG AGC AAC CCC CTT rar 
ser Iju Ala Ser Lys Gly TVr Ihr Lys Phe Val S 

1050 1055 

AGCCACACTGTCGQCGACCCCTACACCGACTACGAGAAGGCEGrrfrv^ 

SerHisOhrValGlyAspProIVr^A^^SSJ^SS^S 
1065 1070 1075 

^S^f'^f^f^^^^AAGGAGAOCTTCAACCCCCIGGIGGCC 

Asp Leu Asp Leu Ser Asn Ala Lys Glu Uur Phe Asn Pro Leu Val Sa 
1080 1085 1090 

.Ala Phe Pro Val Asn Val Ser Met Glu Lys Val lie Leu Ser Pro 
1095 1100 1105 

AACGMAACCIGAGCAACAGCGTGGAGAGCCACTCGAGCACCAACTCG 
Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser ?S 
'■'■^^ 1115 1120 

AGCTACACCAACAOCGAGGGCQCCAGCGTCGAGGCCQGCATCQCTrrtV 
ser IVr Thr Asn Thr Glu Gly Ala Ser Val Glu S S ^ 
^^^^ 1130 1135 



770 



818 



866 



914 



962 



Thr 
1155 



' — «jv. oivj CAC AGC GAG ACT 

Lys Gly He Ser Phe Gly Val Ser Val Asn Tyr Gin His Ser Glu 55 
^^^^ 1145 1150 



1010 
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GTG GCC CAG GAG TCG GGC ACT AGC ACC GGC MC ACC AGC CAG 1058 
Val Ala Gin Glu Trp Gly Thr Ser Thr Gly Asn Thr Ser Gin Phe Asn 
1160 1165 1170 

ACC OCT AGC GCC GGC TAG CTG AAC GCC AAC GTC CXr TAG AAC A^^ 1106 
Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val 
1175 1180 1185 

QGC ACX GGC GCC ATC TAG GAG GTC AAG CCX: ACC AOC AGO TTC II54 
Gly Thr Gly Ala lie Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu 
1190 1195 1200 



1202 



1298 



AAC AAC GAC ACC ATC GCC ACC ATC-AOC GOC AAG TOG AAT TCC ACC GCC 
Asn Asn Asp Hir He Ala Thr He Thr Ala Lys Ser Asn Ser Ohr Ala 
1205 1210 1215 

CTGAACATCAGCCCCGGCGAGAGCTACCCCAAGAAGGGCa^AACGGC 1250 
Leu Asn He Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gin Asn Gly 
1220 1225 1230 1235 

ATCQCCATCACCAa:ATGGACGACTTCAACAGCCACCCCATCACCCTC 
He Ala He Thr Ser Met Asp Asp Hie Asn Ser His Pro He Thr Leu 
1240 1245 1250 

AACAAGAAGCAGGriGGACAACCTGCTGAACAACAAGCCCATCATCCro 1346 
Asn Lys Lys Gin Val Asp Asn Leu Lai Asn Asn Lys Pro Met Met Leu 
1255 1260 1265 

GAGACCAACCAGACCGACGGCGTCTACAAGATCAAGGACACCCACGGC 1394 
Glu Ihr Asn Gin Thr Asp Gly Val Tyr Lys He Lys Asp Thr His Gly 
1270 1275 1280 

AAC ATC GTC ACG GGC GGC GAG TOG AAC GOC GTG ATC GAG GAG ATC AAG 1442 
Asn He Val Thr Gly Gly Glu Trp Asn Gly Val He Gin Gin He Lys 
1285 1290 1295 

GCCAAGACCGCCAGCATCATCGTCGACGACGGCGAGCGCGIGGCCGAG 
Ala Lys Thr Ala Ser He He Val Asp Asp Gly Glu Arg Val Ala Glu 
1300 1305 1310 1315 

AAGOrGTGGCCGCCAAGGACTACGAGAACCCCGAGGACAAGACCCCC 1538 
Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro 
1320 1325 1330 

AGCCIGACCCIGAAGGACQCCCTGAAGCTGAGCTACCCCGACGAGATC 1586 
Ser Leu *nir Leu Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu He 
1335 1340 1345 

AAG GAG ATC GAG GGC TTG CTC TAG TAG AAG AAC AAG CCC ATC TAG GAG 1634 
Lys Glu He Glu Gly Leu Leu IVr Tyr Lys Asn Lys Pro He Tyr Glu 
1350 1355 1360 

AGO AGC GTG ATC ACC TAT CTA GAC GAG AAC ACC GCC AAG GAG C?IX3 ACC 1682 
Ser Ser Val Met Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr 
1365 1370 1375 



1490 
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AAG CAG CTG AAC GAC ACC ACC GGC AAG TTC AAG GAC GTC AGC CAC CTC 
Lys Gin Leu Asn Asp Hir Thr Gly Lys Phe Lys Asp Vai Ser His Leu 
1380 1385 1390 1395 

TAG GAC GTC AAG CTC ACC CCC AAG ATC AAC GTC ACC ATC AAG CIG AGC 
Tyr Asp Val Lys Leu Thr Pro Lys Met Asn Val 'Thr lie Lys Leu Ser 
1400 1405 1410 

ATC CTC TAG GAC AAC GCC GAG AGC AAC GAC AAC AGC ATC GGC AAG TCG 
He Leu Tyr Asp Asii_Ala Glu Ser Asn Asp Asn Ser He Gly LysJCrp - 
. 1415 1420 1425 

ACCAACACCAACATCGTCAQCGQC GQC AAC AAC GGC AAG AAG CAG TAC 
Thr Asn Thr Asn He Val Ser Gly Gly Asn Asn Gly Lys Lys Gin Tyr 
1430 1435 1440 

AGCAGCAACAACCCCGACGCCAACCTCACCCTCAACACCGACQCCCAG 
Ser Ser Asn Asn Pro Asp Ala Asn Leu I4ir Leu Asn Hir Asp Ala Gin 
1445 1450 1455 

GAG AAG CTC AAC AAG AAC CGC GAC TAC TAC ATC AGC CTC TAC ATC AAG 
Glu Lys Leu Asn Lys Asn Arg Asp lyr Tyr He Ser Leu Tyr Met Lys 
1460 1465 1470 1475 

AGC GAG AAG AAC ACC CAG TGC GAG ATC ACC ATC GAC GGC GAG ATA TAC 
Ser Glu Lys Asn Thr Gin Cys Glu He Ihr He Asp Gly Glu He Tyr 
1480 1485 1490 

COC ATC ACC AOC AAG ACC GTC AAC GTC AAC AAG GAC AAC TAC AAG CGC 
Pro He Thr Thr Lys Ihr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg 
1495 1500 1505 

CTC GAC ATC ATC GCC CAC AAC ATC AAG AGC AAC CCC ATC AGC AGC CIG 
Leu Asp He He Ala His Asn He Lys Ser Asn Pro He Ser Ser Leu 
1510 1515 1520 

CAC ATC AAG ACC AAC GAC GAG ATC ACC CTC TIC TOG GAC GAC ATA TCG 
His He Lys Thr Asn Asp Glu He Thr Leu Phe Trp Asp Asp He Ser 
1525 1530 1535 

ATTACCGACGTCGCCAGCATCAAGCCCGAGAACCTGAOCGACAGCGAG 
He Thr Asp Val Ala Ser He Lys Pro Glu Asn Leu Thr Asp Ser Glu 
1540 1545 1550 1555 

ATC AAG CAG ATA TAC AGT CGC TAC GGC ATC AAG CTC GAG GAC GGC ATC 
He Lys Gin He Tyr Ser Arg Tyr Gly He Lys Leu Glu Asp Gly He 
1560 1565 1570 

CTC ATC GAC AAG AAA GGC GQC ATC CAC TAC GGC GAG TTC ATC AAC GAG 
Leu He Asp Lys Lys Gly Gly He His Tyr Gly Glu Phe He Asn Glu 
1575 1580 1585 



GCC AGC TTC AAC ATC GAG CCC CTC CAG AAC TAC GTC ACC AAG TAC GAG 
Ala Ser Phe Asn He Glu Pro Leu Gin Asn Tyr Val Thr Lys Tyr Glu 
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1590 1595 1600 

GTG ACC TAC AGC AGC GAG CTG QGC CCC AAC GTG AGC GAC ACC CTC GAG 2402 
Val Thr Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu 
1605 1610 1615 

AGC GAC AAG ATT TAC AAG GAC GGC ACC ATC AAG TTC GAC TTC ACC AAG 2450 
Ser Asp Lys He Tyr Lys Asp Gly "Thr He Lys Phe Asp Phe Thr Lys 
1620 1625 1630 1635 

TACAGCAAGAACGAGCSVGQQCCTGTICTACGACAGCGQCCTGAACl^ 2498 
Tyr Ser Lys Asn Glu" Gin Gly Leu Phe Tyr Asp Ser Gly Leu AiiTTrp 
1640 _ 1645 1650 

GAC TIC AAG ATC AAC QCC ATC AO: TAC GAC GGC AAG GAG ATG AAC GTO 2546 
Asp Phe Lys He Asn Ala He Hir Tyr Asp Gly Lys Glu Met Asn Val 
1655 1660 1665 

TTC CAC CGC TAC AAC AAG TAGATCPGAG CT 2576 
Phe His Arg lyr Asn Lys 
1670 



(2) INFORMATION FOR SBQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTIH: 852 andno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 36: 

Met Lys Thr Asn Gin lie Ser Thr Ihr Gin Lys Asn Gin Gin Lys Glu 
1 5 10 15 

Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe 
20 25 30 

Ser Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu He Tyr 
35 40 45 

Asp Gin Gin Thr Ala Asn Lys Leu Leu Asp Lys Lys Gin Gin Glu Tyr 
50 55 60 

Gin Ser He Arg Trp He Gly Leu He Gin Ser Lys Glu Thr Gly Asp 
65 70 75 80 

Phe Ihr Phe Asn Leu Ser Glu Asp Glu Gin Ala He He Glu He Asn 
85 90 95 

Gly Lys He He Ser Asn Lys Gly Lys Glu Lys Gin Val Val His Leu 
100 105 110 
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Glu Lys Giy Lys Leu Vai Pro He Lys He Glu 'lyr Gin Ser Asp Thr 
115 120 125 

Lys Phe Asn He Asp Ser Lys Hir Phe Lys Glu Leu Lys Leu Phe Lys 
130 135 140 

He Asp Ser Gin Asn Gin Pro Gin Gin Val Gin Gin Asp Glu Leu Arg 
150 155 160 

Asn Pro Glu Phe Asn Lys Lys Glu Ser Gin Glu Kie Leu Ala Lys Pro 
165 170 175 

Ser Lys He Asn Leu Phe Thr Gin Gin MetrLys Arg Glu He Asp Glu 
180 185 190 

Asp Ihr Asp Thr Asp Gly Asp Ser He Pro Asp Leu Trp Glu Glu Asn 
195 200 205 

Gly lyr Thr He Gin Asn Arg He Ala Val Lys Trp Asp Asp Ser Leu 
210 215 220 

Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His 
225 230 235 240 

•nir Val Gly Asp Pro Tyr Ito Asp Tyr Glu Lys Ala Ala Arg Asp Leu 
245 250 255 

Asp Leu Ser Asn Ala Lys Glu Ihr Phe Asn Pro Leu Val Ala Ala Phe 
260 265 270 

Pro Ser Val Asn Val Ser Ifet Glu Lys Val He Leu Ser Pro Asn Glu 
275 280 285 

Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tvo: 
290 295 300 

Thr Asn Thr Glu Gly Ala Ser Val Glu Ala Gly He Gly Pro Lys Glv 
305 310 315 320 

He Ser Phe Gly Val Ser Val Asn Tyr Gin His Ser Glu 'Thr Val Ala 
325 330 335 

Gin Glu Trp Gly -nir Ser Thr Gly Asn Thr Ser Gin Phe Asn Thr Ala 
340 345 350 

Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr 
355 360 365 

Gly Ala He Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn 
370 375 380 

Asp Thr He Ala Thr He Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn 
385 390 395 400 

He Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gin Asn Gly He Ala 
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405 



410 



415 



lie llir Ser Met Asp Asp Phe Asn Ser His Pro lie l*hr Leu Asn Lys 
420 425 430 

Lys Gin Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Tta: 
435 440 445 

Asn Gin Hhr Asp Gly Val Tyr Lys lie Lys Asp Thr His Gly Asn He 

450 455 460 

Val Thr Gly Gly Glu Trp Asn Gly Val He Gin Gin He Lys Ala Lys 
465 470 475 480 

Hhr Ala Ser He He Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg 
485 490 495 

Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu 
500 505 510 

Thr Leu Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu He Lys Glu 
515 520 525 

He Glu Gly Leu Leu Tyr Tyr Lys Asn Lys Pro He Tyr Glu Ser Ser 

530 535 540 

Val Met Ihr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gin 
545 550 555 560 

Leu Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp 
565 570 575 

Val Lys Leu Thr Pro Lys Met Asn Val Thr He Lys Leu Ser He Leu 
580 585 590 

Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser He Gly Lys Trp Thr Asn 
595 600 605 

Thr Asn He Val Ser Gly Gly Asn Asn Gly Lys Lys Gin Tyr Ser Ser 
610 615 620 

Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gin Glu Lys 
625 630 635 640 

Leu Asn Lys Asn Arg Asp Tyr Tyr He Ser Leu Tyr Met Lys Ser Glu 
645 650 655 

Lys Asn Thr Gin Cys Glu He Thr lie Asp Gly Glu He Tyr Pro He 
660 665 670 

Thr Thr Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp 
675 680 685 



He He Ala His Asn He Lys Ser Asn Pro He Ser Ser Leu His He 
690 695 700 
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Lys Thr Asn Asp Glu He Thr Leu Wie Trp Asp Asp He Ser He Thr 
705 710 715 720 

Asp Val Ala Ser He Lys Pro Glu Asn Leu Thr Asp Ser Glu He Lys 
725 730 735 

Gin He Tyr Ser Arg Tyr Gly He Lys Leu Glu Asp Gly He Leu He 
740 745 750 

Asp Lys Lys Gly Gly He His Tyr Gly Glu Phe He Asn Glu Ala Ser _ 
755 760 765 

Phe Asn He Glu Pro Leu Gin Asn Tyr Val Thr Lys Tyr Glu Val Thr 
770 775 780 

Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp 
785 790 795 800 

Lys He Tyr Lys Asp Gly Ite He Lys Phe Asp Phe ISir Lys TVr Ser 
805 810 815 

Lys Asn Glu Gin Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe 
820 825 830 

Lys He Asn Ala He Ihr Oyr Asp Gly Lys Glu Met Asn Val Phe His 
835 840 845 

Arg Tyr Asn Lys 
850 

(2) INFORMATION F1QR SE3Q ID N0:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) lEiXmi: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDHMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "forward primer used to make 
PCIB5527" 

(iii) HYPOTHBTICAL: NO 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 37: 
QGATCCACCA TGCTGCAGAA CCTGAAGATC AC 
(2) INFORMATIOSr FOR SBQ ID NO: 38: 
(i) SEQUHSICE CHARACTERISTICS: 
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(A) LE3SJGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLEXrULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "reverse primer used to make 
PCIB5527" 

(iii) HYPOTOETICAL: NO 



(xi) SBQUElslCE DESCRIPTION: SBQ ID NO: 38: 
AACCTTCCAC TCCTTCTC 
(2) INFORMATION FOR SBQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMSTH: 1241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic ENA" 

(iii) HYPOTHETICAL: NO 



(ix) FEATORE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 1238 

(D) OTHER INFORMATION: /note= "Maize optiitdzed ENA 
sequence encoding VIP2A(a) with the Bacillus secretion signal 
removed as contained in pCIB5527" 



(xi) SBQUHSICE EMESCRIPITON: SBQ ID NO:39: 

GATCCACC ATGCTCCAGAACCTGAAGATCACCGACAAGGTOGAGGACTrc 
Met Leu Gin Asn Lys He Thr Asp Lys Val Glu Asp Phe 
855 860 865 

AAGGAGGACAAGGAGAAGQCCAAGGAGTGGQGCAAGGAGAAGGAGAAG 
Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys Glu Lys 
870 875 880 



50 



98 



GAGTGGAAGCTTACCGCCACCGAGAAGGGC AAG ATG AAC AAC TIC CTC 146 
Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn Phe Leu 
885 890 895 



GAC AAC AAG AAC GAC ATC AAG ACC AAC TAC AAG GAG ATC ACC TIC AGC 



194 
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Asp Asn Lys Asn Asp lie Lys Thr Asn Tyr Lys Glu lie Thr Phe Ser 
900 905 910 

ATA GCC GGC AGC TTC GAG GAC GAG ATC AAG GAC CIG AAG GAG ATC 242 
lie Ala Gly Ser Phe Glu Asp Glu He Lys Asp Leu Lys Glu He Asd 
915 920 925 930 

AAG ATG TTC GAC AAG ACC AAC CTG AGC AAC AGC ATC ATC ACC TAG AAG 
Lys Met E^e Asp Lys Thr Asn Leu Ser Asn Ser He He Thr Tyr Lys 
935 940 945 

AACGTGQAGCCCACCACCATCQGCTTCAACAAG P^CTG ACC GGC 33^ 
Asn Val Glu Pro Thr Ihr He Gly-ftie Asn Lys Ser Leu Thr Glu Gly 
950 955 960 

AACACCATCAACAGCGACGCCATGGCCCAGTrcAAGGAGCAGTrc 
Asn Thr He Asn Ser Asp Ala Met Ala Gin E*ie Lys Glu Gin Phe Leu 
965 970 975 

GAC CGC GAC ATC AAG TTC GAC AGC TAC CTG GAC ACC CTC ACC 434 
Asp Arg Asp He Lys Phe Asp Ser Tyr Leu Asp Thr His Leu Thr Ala 
980 985 990 



386 



482 



530 



626 



CAGCAGGTGAGCAGCAAGGAGCGCGTGATCCTGAAGGTCACCGTCCCC 
Gin Gin Val Ser Ser Lys Glu Arg Val He Leu Lys Val Thr Val Pro 
995 1000 1005 1010 

AGC GQC AAG GGCAGCACCAOCCCCACCAAGGCCGGCGTGATCCTCAAC 
Ser Gly Lys Gly Ser Thr Thr Pro Ihr Lys Ala Gly Val He Leu Asn 
1015 1020 1025 

AAC AGC GAG TAC AAG ATG CTG ATC GAC AAC GGC TAC ATG GIG CAC GIG 578 
Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Met Val His Val 
1030 1035 1040 

GAC AAG GIG AGC AAG GTGGTGAAGAAGQGCGTGGAGTGCCIGCAGATC 
Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gin He 
1045 1050 1055 

GAG GGC ACC CIG AAG AAG AGT CPA GAC TIC AAG AAC GAC ATC AAC QCC 674 
Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He Asn Ala 
1060 1065 1070 

GAG GCC CAC AGC TGG QGC ATG AAG AAC TAC GAG GAG TGG GCC AAG GAC 722 
Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala Lys Asp 
1075 1080 1085 1090 

CTG ACC GAC AGC CAG OGC GAG GCC CIG GAC GGC TAC GCC CGC CAG GAC 770 
l^u Thr Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Arg Gin Asp ' 
1095 1100 1105 

TAC AAG GAG ATC AAC AAC TAC CIG OCX: AAC CAG GGC GGC AGC GQC AAC 818 
Tyr Lys Glu He Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser Gly Asn 
1110 1115 1120 
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GAG AAG CTG GAC GCC CAG ATC AAG AAC ATC AGC GAC GCC CTC GGC AAG 866 
Giu Lys Leu Asp Ala Gin lie Lys Asn lie Ser Asp Ala Leu Gly Lys 
1125 1130 1135 

AAG CCC ATC CXXT GAG AAC ATC ACC GTG TAC CX3C TC3G TQC GGC ATG COC 914 
Lys Pro lie Pro Glu Asn He Hir Val Tyr Arg Trp Cys Gly ^4et Pro 
1140 1145 1150 

GAG TTC GGC TAC CAG ATC AGC GAC OCC CTG CCC AGC CTG AAG GAC TTC 962 
Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys Asp Phe 
1155 1160 1165_ 1170 

GAG GAG CAG TIC CPG^AAC ACC ATC^AAG GAG GAC AAG GGC TAC ATG AGC 1010 
Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly Tyr Met Ser 
1175 1180 1185 

ACC AGC CTG AGC AGC GAG CGC CIG GCC GCC TTC GGC AGC OGC AAG ATC 1058 
Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg Lys He 
1190 1195 1200 

ATC CTG CGC CTG CAG GTG CCC AAG GGC AGC ACT OGT GCC TAC CTG AGC 1106 
He Leu Arg Leu Gin Val Pro Lys Gly Ser Ihr Gly Ala Tyr Leu Ser 
1205 1210 1215 

GCC ATC GGC GGC TIC GCC AGC GAG AAG GAG ATC CTG CTG GAT AAG GAC 1154 
Ala He Gly Gly Phe Ala Ser Glu Lys Glu He Leu Leu Asp Lys Asp 
1220 1225 1230 

AGC AAG TAC CAC ATC GAC AAG GTG ACC GAG GTG ATC ATC AAG GGC GTG 1202 
Ser Lys TVr His He Asp Lys Val Thr Glu Val He He Lys Gly Val 
1235 1240 1245 1250 

AAG OGC TAC GTG GTG GAC GCC ACC CTG CTG ACC AAC TAG 1241 
Lys Arg Tyr Val Val Asp Ala Ttir Leu Leu Thr Asn 
1255 1260 



(2) INFORMATION FOR SBQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCSriH: 410 amino acids 

(B) TYPE: aitdno acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SBQUE^E DESCRIPTION: SBQ ID N0:40: 

Met Leu Gin Asn Leu Lys He Thr Asp Lys Val Glu Asp Phe Lys Glu 
1.5 10 15 

Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys Glu Lys Glu Trp 
20 25 30 

Lys Leu Thr Ala Thr Giu Lys Gly Lys Met Asn Asn Phe Leu Asp Asn 
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35 40 4'3 

Lys Asn Asp He Lys Thr Asn Tyr Lys Glu He Thr Phe Ser He Ala 
50 55 60 

Gly Ser Phe Glu Asp Glu He Lys Asp Leu Lys Glu He Asp Lys Met 
65 70 75 80 

Phe Asp Lys Thr Asn Leu Ser Asn Ser He He Thr Tyr Lys Asn Val 
85 90 95 

Glu Pro Thr Ohr He Gly Phe Asn I^^er Leu Thr Glu Gly Asn Thr 

100 105 — no 

He Asn Ser Asp Ala Met Ala Gin Phe Lys Glu Gin Phe Leu Asp Arg 
115 120 125 

Asp He Lys Phe Asp Ser TVr Leu Asp Ohr His Leu Thr Ala Gin Gin 
130 135 140 

Val Ser Ser Lys Glu Arg Val He Leu Lys Val Thr Val Pro Ser Gly 
145 150 155 160 

Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val He Leu Asn Asn Ser 
165 170 175 

Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Met Val His Val Asp Lys 
180 185 190 

Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gin He Glu Gly 
195 200 205 

Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He Asn Ala Glu Ala 
210 215 220 

His Ser Trp Gly Met Lys Asn lyr Glu Glu Trp Ala Lys Asp Leu Thr 
225 230 235 240 

Asp Ser Gin Arg Glu Ala Leu Asp Gly lyr Ala Arg Gin Asp 'Tyr Lys 
245 250 255 

Glu He Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser Gly Asn Glu Lys 
260 265 270 

Leu Asp Ala Gin He Lys Asn He Ser Asp Ala Leu Gly Lys Lys Pro 
275 280 285 

He Pro Glu Asn He Thr Val Tyr Arg Trp Cys Gly Met Pro Glu Phe 
290 295 300 

Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys Asp Kie Glu Glu 
305 310 315 320 

Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly Tyr Met Ser Thr Ser 
325 330 335 
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Leu Ser Ser 



Glu Arg Leu Ala Ala Phe Gly Ser Arg 
340 345 



Lys He He Leu 
350 



Arg Leu Gin 
355 



Val Pro Lys Gly Ser Thr Gly Ala TVr 
360 



Leu Ser Ala He 
365 



Gly Gly Phe 
370 



Ala Ser Glu Lys Glu He Leu Leu Asp 
375 380 



Lys Asp Ser Lys 



Tyr His He 
385 



Asp Lys Val ThriGlu Val He He Lys 
390 395 



Gly Val Lys Arg 
400 



TVr Val Val 



Asp Ala Thr Leu Leu Ihr Asn 
405 410 



(2) INFOPMATIQN FDR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LEWSIH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLEXM^ TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide encoding 
eukaryotic secretion signal used to construct pCIB5527*' 

(iii) HYPOTHETICAL: NO 



(xi) SEQUElslCE DESCRIPTION: SEQ ID N0:41: 
QGATCCACCA TQQGCTGGAG CTOGATCTTC CTGTrCCTQC TGAQOGGOQC OGOQQQOGTG 60 
CACTQCCTGC AG 72 
(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEN3TH: 1241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MDLBCULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic ENA" 

(iii) HYPOMETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 9. .1238 

(D) OIHEl^ INFORMATION: /note= "Maize optimized CNA 
sequOTce encoding VIP2A{a) with the Bacillus secretion signal 
removed and the eukaryotic secretion signal inserted as 
contained in pCIB5528" 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 42: 

GATCCACX: ATG CTG CAG AAC CTG AAG ATC ADC GAG AAG GTIG GAG GAC TTC 50 
Met Leu Gin Asn Leu Lys He Olir Asp Lys Val Glu Asp Phe 
^ 415 420 — 

98 



146 



AAG 
Lys 
425 


GAG 
Glu 


GAC 
Asp 


AAG 
Lys 


GAG 
Glu 


AAG 
Lys 
430 


GCC 
Ala 


AAG 
Lys 


GAG 
Glu 


TOG 
TCP 


GGC 
Gly 
435 


AAG 
Lys 


GAG 
Glu 


AAG 
Lys 


GAG 
Glu 


AAG 
Lys 
440 


GAG 
Glu 


TGG 
Ttp 


AAG 
Lys 


CTT 
Leu 


ACC 
Thr 
445 


GOC 
Ala 


ACC 
Thr 


GAG 
Glu 


AAG 
Lys 


GGC 
Gly 
450 


AAG 
Lys 


ATG 
Met 


AAC 
Asn 


AAC 
Asn 


TIC 
Phe 
455 


CTG 
Leu 


GAC 
Asp 


AAC 
Asn 


AAG 
Lys 


AAC 
Asn 
460 


GAC 
Asp 


ATC 
He 


AAG 
Lys 


ACC 
Thr 


AAC 
Asn 
465 


TAC 
Tyr 


AAG 
Lys 


GAG 
Glu 


ATC 
He 


ACC 
Thr 
470 


TIC 
Phe 


AGC 
Ser 


ATA 
He 


GCC 
Ala 


GGC 
Gly 
475 


AGC 
Ser 


TIC 
Phe 


GAG 
Glu 


GAC 
Asp 


GAG 
Glu 
480 


ATC 
He 


AAG 
Lys 


GAC 
Asp 


CTG 
Leu 


AAG 
Lys 
485 


GAG 
Glu 


ATC 
He 


GAC 
Asp 


AAG 
Lys 


ATG 
Met 
490 


TTC 
Phe 


GAC 
Asp 


AAG 

Lys 


ACC 
Hut 


AAC 
Asn 
495 


CTG 
Leu 


AGC 
Ser 


AAC 
Asn 


AGC 
Ser 


ATC 
He 
500 


ATC 
He 


ACC 
'Thr 


TAC 
Tyr 


AAG 
Lys 


AAC 
Asn 
505 


GTG 
Val 


GAG 
Glu 


ax 

Pro 


ACC 


ACC 
Ihr 
510 


ATC 
He 


GGC 
Gly 


TTC 
Phe 


AAC 
Asn 


AAG 
Lys 
515 


AGC 
Ser 


CTG 
Leu 


ACC 


GAG 
Glu 


GGC 
Gly 
520 


AAC 
Asn 


ACC 
Hhr 


ATC 
He 


AAC 
Asn 


AGC 

Ser 
525 


GAC 
Asp 


GCC 
Ala 


ATG 
Met 


GCC 
Ala 


CAG 

Gin 
530 


TTC 
Phe 


AAG 
Lys 


GAG 
Glu 


CAG 
Gin 


TTC 
535 


CTG 
Leu 


GAC 
Asp 


CGC 
Arg 


GAC 
Asp 


ATC 
He 
540 


AAG 
Lys 


TTC 
Phe 


GAC 
Asp 


AGC 
Ser 


TAC 
lyr 
545 


CTG 
Leu 


GAC 
Asp 


ACC 
Ohr 


CAC 
His 


CTG 
Leu 
550 


ACC 
Thr 


GCC 
Ala 



194 



242 



290 



338 



386 



434 



CAG CAG GTG AGC AGC AAG GAG CGC GTG ATC CTG AAG GTG ACC GTC CCC 482 
Gin Gin Val Ser Ser Lys Glu Arg Val He Leu Lys Val Ttrr Val Pro 
555 560 565 

AGC GGC AAG GGC AGC ACC ACC CCC ACC AAG GOC GGC GTG ATC CTG AAC 530 
Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val He Leu Asn 
570 575 580 



AAC AGC GAG TAC AAG ATG CTG ATC GAC AAC GGC TAC ATG GTG CAC GTC 
Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Met Val His Val 



578 
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585 590 595 600 

GAC AAG GTG AGC AAG GTG GIG AAG AAG GGC GTG GAG TQC CTC CAG ATC 626 
Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gin He 
605 610 615 

GAG GGC ACC CTG AAG AAG AGT CTA GAC TTC AAG AAC GAC ATC AAC GCC 674 
Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He Asn Ala 
620 625 630 

GAG GCC CAC AGC TOG GGC ATG AAG AAC TAG GAG GAG TOG GCC AAG GAC 722 
Glu-Ala His Ser Trp Gly Met Lys Asn Tyr Glu Gia Trp Ala Lys Asp 
635 _ 640 645 

CTG ACC GAC AGC CAG CGC GAG GCC CTG GAC GGC TAG GCC OQC CAG GAC 770 
Leu Ito: Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Poog Gin Asp 
650 655 660 

TAC AAG GAG ATC AAC AAC TAC CTG CGC AAC CAG GGC GGC AGC GGC AAC 818 
Tyr Lys Glu He Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser Gly Asn 
665 670 675 680 

GAG AAG CTG GAC GCC CAG ATC AAG AAC ATC AGC GAC GCC CTG GGC AAG 866 
Glu Lys Leu Asp Ala Gin He Lys Asn He Ser Asp Ala Leu Gly Lys 
685 690 695 

AAG CCC ATC CCC GAG AAC ATC ACC GTG TAC OQC TGG TGC GGC ATG OOC 914 
Lys Pro He Pro Glu Asn He Thr Val Tyr Arg Trp Cys Gly Met Pro 
700 705 710 

GAG TIC GGC TAC CAG ATC AGC GAC CCC CTG CCC AGC CTG AAG GAC TTC 962 
Glu £^e Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys Asp Phe 
715 720 725 

GAG GAG CAG TTC CTG AAC ACC ATC AAG GAG GAC AAG GGC TAC ATG AGC 1010 
Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly Tyr Met Ser 
730 735 740 

ACC AGC CTG AGC AGCGAGCGCCTGGCCGCCTTCGGCAGCCGCAAGATC 1058 
Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Hie Gly Ser Arg Lys He 
745 750 755 760 

ATC CTG OGC CTG CAG GTG CCC AAG OQC AGC ACT GGT GCC TAC CTG AGC 1106 
He Leu Arg Leu Gin Val Pro Lys Gly Ser Thr Gly Ala Tyr Leu Ser 
765 770 775 

GCC ATC GGC GGC TTC GCC AGC GAG AAG GAG ATC CTG CTG GAT AAG GAC 1154 
Ala He Gly Gly Phe Ala Ser Glu Lys Glu He Leu Leu Asp Lys Asp 
780 785 790 

AQCAAGTACCACATCGACAAGGTGACCGAGGTGATCATCAAGGGCGTO 1202 
Ser Lys Tyr His He Asp Lys Val Thr Glu Val He He Lys Gly Val 
795 800 805 

AAG CGC TAC GTG GIG GAC GCC ACC CTG CTG ACC AAC TAG 1241 
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Lys Arg 'lyr Val Val Asp Ala Thr Leu Leu Thr Asn 
810 815 820 



(2) INFORMATION FOR SBQ It) NO: 43: 

(i) SBQUEJJCE CHARACTERISTICS: 

(A) LEXOGIH: 410 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

_ (ii) MOLBCUUE TYPE: protein 

(xi) SEQUENCE raSCRIPTICN: SBQ ID NO: 43: 

Met Leu Gin Asn Lea Lys He Thr Asp Lys Val Glu Asp Phe Lys Glu 
15 10 15 

Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys Glu Lys Glu Trp 
20 25 30 

Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn Phe Leu Asd Asn 
35 40 45 " 

Lys Asn Asp He Lys Thr Asn Tyr Lys Glu He Thr Phe Ser He Ala 
50 55 60 

Gly Ser Phe Glu Asp Glu He Lys Asp Leu Lys Glu He Asp Lys Met 
65 70 75 80 

Phe Asp Lys Thr Asn Leu Ser Asn Ser He He Thr Tyr Lys Asn Val 
85 90 95 

Glu Pro Thr Thr He Gly Phe Asn Lys Ser Leu Thr Glu Gly Asn Thr 
100 105 no 

He Asn Ser Asp Ala Met Ala Gin Phe Lys Glu Gin Phe Leu Asp Arg 

115 120 125 

Asp He Lys Phe Asp Ser Tyr Leu Asp Ihr His Leu Thr Ala Gin Gin 
130 135 140 

Val Ser Ser Lys Glu Arg Val He Leu Lys Val Thr Val Pro Ser Gly 
145 150 155 160 

Lys Gly Ser Thr 'Thr Pro Thr Lys Ala Gly Val He Leu Asn Asn Ser 
165 170 175 

Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Met Val His Val Asp Lys 
180 185 190 

Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gin He Glu Gly 
195 200 205 

Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He Asn Ala Glu Ala 
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210 215 220 

His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala Lys Asp Leu Tlir 
225 230 235 240 

Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Arg Gin Asp Tyr Lys 
245 250 255 

Glu lie Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser Gly Asn Glu Lys 
260 265 270 

Leu Asp Ala Gin lie Lys Asn lie Ser A^ Ala Leu Gly Lys 15^ Pro 

275 280 — 285 

He Pro Glu Asn He Thr Val lyr Arg Trp Cys Gly Met Pro Glu E^e 
290 295 300 

Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys Asp Phe Glu Glu 
305 310 315 320 

Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly Tyr Met Ser Thr Ser 
325 330 335 

Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg Lys He He Leu 
340 345 350 

Arg Leu Gin Val Pro Lys Gly Ser Thr Gly Ala Tyr Leu Ser Ala He 
355 360 365 

Gly Gly Phe Ala Ser Glu Lys Glu He Leu Leu Asp Lys Asp Ser Lys 
370 375 380 

Tyr His He Asp Lys Val Thr Glu Val He He Lys Gly Val Lys Arg 
385 390 395 400 

Tyr Val Val Asp Ala Thr Leu Leu Thr Asn 
405 410 

(2) mPWMkTlW FOR SBQ ID NO:44: 

(i) SBSUENCE CHARACTERISTICS: 

(A) l£tKmi: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEl^ESS: single 

(D) TOPOLOGY: linear 

(ii) MDLBCULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc := "oligonucleotide encoding 
vacuolar targetting peptide \ised to construct pCIB5533" 

(iii) HYPOTHETTICAL: NO 
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(xi) SEQUEICE DESCRIPriON: SBQ ID NO:44: 
CCGCQGGOGT GCACTGCCTC AGCAGCAGCA GCITC3GCCGA CAGCAACCCC ATCCGCX3TCA 
CXDGACCGCGC OGCCAQCACC CTQCAG 
(2) INFCmiATION FOR SBQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC3TH: 1358 base pairs 

(B) TCPE: nucleic acid 

(C) STRANDEIKESS: single ^ _ — — 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) rasCRIPTIGN: /desc = "Synthetic ENA" 

(iii) HYPOTOETICAL: NO 



(ix) FEA3URB: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 1355 

(D) OTHER INFORMATION: /note= "Maize optimized VIP2A(a) 
with the Bacillus secretion signal removed and the vacuolar 
targetting signal inserted as contained in pCIB5533*' 



(xi) SEQUENCE DESCRIPTIC^I : SBQ ID NO: 45: 

GATCCACC ATGGGCTGGAGCTQGATCTTCCTGTTCCTCCTCAGCGGCCS^ 
Met Gly Trp Ser Trp He Phe Leu Phe Leu Leu Ser Gly Ala 

GCG GGC GTG CAC TQC 
Ala Gly Val His Cys 
425 

ATC OGC GTC ACC GAC 
He Ar:g Val Hu: Asp 
445 

ACC GAC AAG GTG GAG 
•ttir Asp Lys Val Glu 
460 

TGG GGC AAG GAG AAG 
Trp Gly Lys Glu Lys 
475 

GGC AAG ATG AAC AAC 
Gly Lys Met Asn Asn 
490 

TAC AAG GAG ATC ACC 



415 



420 



CTCAGCAGCAGCAGCTTCGCCGACAGCAACCCC 98 
Leu Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro 
430 435 440 

CGC GGC GCC AGC ACC CTG CAG AAC CTG AAG ATC 146 
Arg Ala Ala Ser Tte Leu Gin Asn Leu Lys He 

450 455 

GAC TTC AAG GAG GAC AAG GAG AAG GCC AAG GAG 194 
Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu 
465 470 

GAG AAG GAG TGG AAG CTT ACC GCC ACC GAG AAG 242 
Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys 
480 485 

TTC CTG GAC AAC AAG AAC GAC ATC AAG ACC AAC 290 
ttie teu Asp Asn Lys Asn Asp He Lys Ihr Asn 
495 500 



TTC AGC ATA GCC GGC AGC TTC GAG GAC GAG ATC 



338 
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lyr Lys Glu lie liir Phe Ser lie Ala Gly Ser Phe Glu Asp Glu He 
505 510 515 520 

AAG GAC CTG AAG GAG ATC GAC AAG ATG TIC GAC AAG ACC AAC CTG AGC 386 
Lys Asp Leu Lys Glu He Asp Lys Met Phe Asp Lys Thr Asn Leu Ser 
525 530 535 

AAC AGC ATC ATC ACC TAC AAG AAC CTTG GAG CCC ACC ACC ATC GGC TTC 434 
Asn Ser He He Tte Tyr Lys Asn Val Glu Pro Thr Thr He Gly Phe 
540 545 550 

AAC AAG AGC CTG ACC GAG GGC AHC ACC ATC AAC AGC-^AC GCC ATG GCC 482 
Asn-Lys Ser Leu Tfir Glu Gly Asn Thr He Asn Ser Asp Ala Met Ala 
555 560 565 

CAG TIC AAG GAG CAG TIC CTG GAC OQC GAC ATC AAG TTC GAC AGC TAC 530 
Gin Phe Lys Glu Gin Phe Leu Asp Arg Asp He Lys Phe Asp Ser Tyr 
570 575 580 

CTG GAC ACC CAC CIG ACC GCC CAG CAG GTG AGC AGC AAG GAG OQC GTG 578 
Leu Asp Thr His Leu Thr Ala Gin Gin Val Ser Ser Lys Glu Arg Val 
585 590 595 600 

ATC CTG AAG GTG ACC GTC CCC AGC GGC AAG GGC AGC ACC ACC CCC ACC 626 
He Leu Lys Val Thr Val Pro Ser Gly Lys Gly Ser Tte Thr Pro Thr 
605 610 615 

AAG GCC GGC GTG ATC CTG AAC AAC AGC GAG TAC AAG ATG CTG ATC GAC 674 
Lys Ala Gly Val He Leu Asn Asn Ser Glu Tyr Lys Met Leu He Asp 
620 625 630 

AAC GGC TAC ATG GTIG CAC GTC GAC AAG GTG AGC AAG GTG GTG AAG AAG 722 
Asn Gly Tyr Met Val His Val Asp Lys Val Ser Lys Val Val Lys Lys 
635 640 645 

GGC GTG GAG TQC CTC CAG ATC GAG GGC ACC CTG AAG AAG AGT CTA GAC 770 
Gly Val Glu Cys Leu Gin He Glu Gly Ihr Leu Lys Lys Ser Leu Asp 
650 655 660 

TTC AAG AAC GAC ATC AAC GCC GAG GCC CAC AGC TOG GGC ATG AAG AAC 818 
Phe Lys Asn Asp He Asn Ala Glu Ala His Ser Trp Gly MetJys Asn 
665 670 675 680 

TAC GAG GAG TOG GCC AAG GAC CTG ACC GAC AGC CAG CGC GAG GCC CTG 866 
Tyr Glu Glu Trp Ala Lys Asp Leu Thr Asp Ser Gin Arg Glu Ala Leu 
685 690 695 

GAC GGC TAC GCC CGC CAG GAC TAC AAG GAG ATC AAC AAC TAC CTG CGC 914 
Asp Gly Tyr Ala Arg Gin Asp lyr Lys Glu He Asn Asn Tyr Leu Arg 
700 705 710 

AAC CAG GGC GGC AGC GGC AAC GAG AAG CTG GAC GOC CAG ATC AAG AAC 962 
Asn Gin Gly Gly Ser Gly Asn Glu Lys Leu Asp Ala Gin He Lys Asn 
715 720 725 
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AlC AGC GAC GCC CTG GGC AAG AAG CCC ATC (XC GAG AAC ATC ACC GTC 1010 
lie Ser Asp Ala Leu Gly Lys Lys Pro lie Pro Glu Asn lie Thr Val 
730 735 740 



TAG CX3C TGG T3C GGC ATG CCC GAG TIC QGC TAG CAG ATC AGC 

Tyr Arg Trp Cys Gly Met Pro Glu Phe Gly Oyr Gin He Ser Asp Pro 

745 750 755 760 



1058 



CTGCXXTAGCCTCAAGGACTrcGAGGAGa^TTCCro noe 
Leu Pro Ser Leu Lys Asp Phe Glu Glu Gin Hie Leu Asn Thr He Lys 

765 71D 775 

GAG GAC AAG-GGC TAG ATC AGO ACC AGC CTG AQC AGC GftG CGC CTC GCC 1154 
Glu Asp Lys Gly Tyr Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala 
780 785 790 

GCC TTC GGC AGC OQC AAGATCATCCT30GCCTGCAGGTGCCCAAGQGC 1202 
Ala ttie Gly Ser Arg Lys He He Leu Arg Leu Gin Val Pro Lys Gly 
795 800 805 

AGC ACT QGT GCC TAG CTC AGC GCC ATC GGC GGC TTC GCC AGC GAG AAG 1250 
Ser rhr Gly Ala Tyr Leu Ser Ala He Gly Gly Phe Ala Ser Glu Lys 
810 815 820 

GAG ATC CTG CTG GAT AAG GAC AGC AAG TAC CAC ATC GAC AAG GTC ACC 1298 
Glu He Leu Leu Asp Lys Asp Ser Lys OVr His He Asp Lys Val Hhr 
825 830 835 840 

GAGGTCATCATCAAGGGCGTCAAGCGCTACGTCGTCGACGOCACCCTC 1346 
Glu Val He He Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu 
845 850 855 

CTC ACC AAC TAG 3^353 
Leu Thr Asn 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SBQUE2«:e CHARACTERISTICS: 

(A) LEN3TH: 449 amino acic3s 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Gly Trp Ser Trp He Phe Leu Phe Leu Leu Ser Gly Ala Ala Gly 
1 5 10 15 

Val His Cys Leu Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro He Arg 
20 25 30 

Val Thr Asp Arg Ala Ala Ser Thr Leu Gin Asn Leu Lys He Thr Asp 
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35 



40 



45 



Lys Val Glu Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly 
50 55 60 

Lys Glu Lys Glu Lys Glu Trp Lys Leu Tbr Ala Tbr Glu Lys Gly Lys 
65 70 75 80 

Met Asn Asn Phe Leu Asp Asn Lys Asn Asp lie Lys Thr Asn Tyr Lys 
85 90 95 

Glu He Thr Phe Ser He Ala Gly .Ser Phe Glu Asp Qlu-Ile Lys Asp 
100 105 - 110 

Leu Lys Glu He Asp Lys Met Phe Asp Lys Ihr Asn Leu Ser Asn Ser 
115 120 125 

He He Thr Tyr Lys Asn Val Glu Pro Thr Ihr He Gly Phe Asn Lys 
130 135 140 

Ser Leu Thr Glu Gly Asn Thr He Asn Ser Asp Ala Met Ala Gin Ete 
145 150 155 160 

Lys Glu Gin Phe Leu Asp Arg Asp He Lys PhB Asp Ser Tyr Leu Asp 
165 170 175 

Thr His Leu Thr Ala Gin Gin Val Ser Ser Lys Glu Arg Val He Leu 



Lys Val Thr Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala 
195 200 205 

Gly Val He Leu Asn Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly 
210 215 220 

Tyr Met Val His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val 
225 230 235 240 

Glu Cys I^u Gin He Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys 
245 250 255 

Asn Asp He Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn IV^^lu 
260 265 270 

Glu Trp Ala Lys Asp Leu Thr Asp Ser Gin Arg Glu Ala Leu Asp Gly 
275 280 285 

Tyr Ala Arg Gin Asp Tyr Lys Glu He Asn Asn Tyr Leu Arg Asn Gin 
290 295 300 

Gly Gly Ser Gly Asn Glu Lys Leu Asp Ala Gin He Lys Asn He Ser 
305 310 315 320 

Asp Ala Leu Gly Lys Lys Pro He Pro Glu Asn He Thr Val Tyr Arg 



180 



185 



190 



325 



330 



335 
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Trp Cys Gly Met Pro Glu t^e Gly Tyr Gin He Ser Asp Pro Leu Pro 

340 345 350 



Ser Leu Lys Asp Phe Glu Glu Gin Phe Leu Asn Tfhr lie Lys Glu Asp 
355 360 365 



Lys Gly Tyr Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala £*ie 
370 375 380 

Gly Ser Arg Lys He He Leu Arg L eu Gi n Val Pro Lys Gly Ser Thr 
—585 390 395 400 

Gly Ala Tyr Leu Ser Ala He Gly Gly Phe Ala Ser Glu Lys Glu He 
405 410 415 

Leu Leu Asp Lys hsg Ser Lys Tyr His He Asp Lys Val Thr Glu Val 
420 425 430 

He He Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr 
435 440 445 



Asn 



(2) INFORMATION FOR SEQ ID N0:47: 

(i) SBQUHWCE CHARACTERISTICS: 

(A) LE3^3TH: 16 amino acids 

(B) TYPE: andno acid 

(C) STRANIM:NESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOIHEnCAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: P^tide 

(B) IOCATICN: 1..16 

(D) OTOER INFORMATION: /note= "linker peptide for fusion 
of VIPlA(a) and VIP2A(a) used to construct pCIB5533" 



(xi) SBQUHCE DESCRIPTION: SEQ ID N0:47: 

Pro Ser Thr Pro Pro Thr Pro Ser Pro Ser Thr Pro Pro Thr Pro Ser 
1 5 10 15 

(2) INFORMATION FOR SEQ ID N0:48: 

Ii) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 66 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLEXIULE TYPE: other nucleic acid 

(A) DESCRIPTiaa: /desc = -ENA encoding linker peptide 
used to construct pCIB5533" 

(iii) HYPOEHETICAL: NO 



(xi) SE;XmCE DESCMPTICN: SBQ ID NO:48: 
CCCX3QQCCTr CTACTCCCCC AACTCCCTCT OCTAGCACGC OTCCGACftOC TAOCGATATC 60 
C3GATCC 66 
(2) INFORMATIGN FOR SBQ ID N0:49: 

(i) SEQUENCE CHARACTERISmCS : 

(A) LEN3TH: 4031 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ^ "Synthetic CNA" 

(iii) HYPOIHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 6.. 4019 

(D) OraER INFORMATION: /note= "Maize cptimized ENA 
sequeice encoding a VIP2A(a) - VIPlA(a) fusicMi protein as 
contained in pCIBSSSl" 



(xi) SEQUENCE DESCRIPTION: SBQ ID N0:49: 

GATCC ATC AAG OGC ATC GAG GGC AAG CTC TTC ATG GTC AGC AAG AAG 47 
Met Lys Arg Met Glu Gly Lys Leu ttie Met Val Ser Lys Lys 
450 455 460 

CTC GAG GTG GTG ACC AAG ACC GIG CTC CTG AGC ACC GTG TTC AGC ATC 95 
Leu Gin Val Val Utac Lys Thr Val Leu Leu Ser *nir Val Phe Ser He 
465 470 475 

AGCCTGCTGAACAACGftGGTGATCAAGGOCGAGCAGCTGAACATCAAC 143 
Ser Leu Leu Asn Asn Glu Val He Lys Ala Glu Gin Leu Asn He Asn 
480 485 490 495 

AGC GAG AGC AAG TAC ACC AAC CTC CAG AAC CTG AAG ATC ACC GAC AAG 191 
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Ser Gin Ser Lys lyr Thr Asn Leu Gin Asn Leu Lys He rhr Asp Lys 
500 505 510 

GTG GAG GAG TIC AAG GAG GAC AAG GAG AAG GCC AAG GAG TOG GQC AAG 239 
Val Glu Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys 
515 520 525 

GAG AAG GAG AAG GAG TGG AAG CTT ACXT GCC ACC GAG AAG GGC AAG ATC 287 
Glu Lys Glu Lys Glu Trp Lys Leu Ihr Ala Ihr Glu Lys Gly Lys Met 
530 535 540 

_A^ AAC TIC CTG GAC AAC AAG AAC GAC ATC AAG ACC AAC^AC AAG GAG 335 
Asn Asn Phe Leu Asp Asn Lys Asn Asp He Lys Thr Asn Tyr Lys Glu 
545 550 555 

ATC ACC TTC AQC ATA GGC GQC AGC TIC GAG GAC GAG ATC AAG GAC CTC 383 
He Thr Phe Ser He Ala Gly Ser ttie Glu Asp Glu He Lys Asp Leu 
560 565 570 575 

AAG GAG ATC GAC AAG ATG TIC GAC AAG ACC AAC CTS AGC AAC AQC ATC 431 
Lys Glu He Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser He 
580 585 -y 590 

ATC ACC TAC AAG AAC GTC GAG CCC ACC AOC ATC GGC TIC AAC AAG AGC 479 
He Ihr Tyr Lys Asn Val Glu Pro Thr Thr He Gly fte Asn Lys Ser 
595 600 605 

CTG ACC GAG GGC AAC AOC ATC AAC AGC GAC GCC ATG GCC CAG TIC AAG 527 
Leu Ihr Glu Gly Asn Ihr He Asn Ser Asp Ala Met Ala Gin Phe Lys 
610 615 620 

GAG CAG TTC CTG GAC OGC GAC ATC AAG TIC GAC AGC TAC CTC GAC ACC 575 
Glu Gin Phe Leu Asp Arg Asp He Lys Phe Asp Ser Tyr Leu Asp Ihr 
625 630 635 

CACCTGACCGCCCAGCAGGTGAGCAGCAAGGAGaSCGTGATCCIGAAG 623 
His Leu Thr Ala Gin Gin Val Ser Ser Lys Glu Arg Val He Leu Lys 
640 645 650 655 

GTCACCC?rcCCCAGCGGCAAGGQC AGC .ACC ACC CCC ACC AAG GCC GGC 671 
Val Thr Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly 
660 665 670 

GTC ATC CTC AAC AAC AQC GAG TAC AAG ATC CTC ATC GAC AAC GGC TAC 719 
Val He Leu Asn Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly lyr 
675 680 685 



ATC GTC CAC GTC GAC AAG GTC AQC AAG GTC CTIG AAG AAG GQC GTC GAG 767 
Met Val His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu 
690 695 700 

TGC CTC CAG ATC GAG GGC ACC CTC AAG AAG AGT CTA GAC TIC AAG AAC 815 
Cys Leu Gin He Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn 
705 710 715 
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GAC ATC AAC GCC GAG GCC CAC AGC TOG GQC ATC AAG AAC TAC GAG GAG 863 
Asp lie Asn Ala Giu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu 
720 725 730 735 

TGG GCC AAG GAC CTG ACC GAC AGC CAG CGC GAG GCC CTC GAC GGC TAC 911 
Trp Alai Lys Asp Leu Ihr Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr 
740 745 750 

GCC CGC CAG GAC TAC AAG GAG ATC AAC AAC TAC CTG CGC AAC CAG GQC 959 
Ala Arg Gin Asp Tyr Lys Glu He Asn Asn Tyr Leu Arg Asn Gin Gly 
755 . 760 765 _ 

GQC AGC GGC AAC GAG AAG CTG GAC QCG-GAG ATC AAG^AAC ATC AGC GAC 1007 
Gly Ser Gly Asn Glu Lys Leu Asp Ala Gin He Lys Asn He Ser Asp 
770 775 780 

GCC CTG GGC AAG AAG CCC ATC CCC GAG AAC ATC ACC GTG TAC CGC TOG 1055 
Ala Leu Gly Lys Lys Pro He Pro Glu Asn He Thr Val Tyr Arg Trp 
785 790 795 

TQC GQC ATC CCC GAG TTC GQC TAC CAG ATC AGC GAC CCC CTG CCC AGC 1103 
Cys Gly Met Pro Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser 
800 805 810 815 

CTG AAG GAC TIC GAG GAG CAG TIC CTG AAC ACC ATC AAG GAG GAC AAG 1151 
Leu Lys Asp Phe Glu Glu Gin Phe Leu Asn Utar He Lys Glu Asp Lys 
820 825 830 

GGC TAC ATC AGC ACC AGC CTC AGC AGC GAG CGC CTC GCC GCC TTC GGC 1199 
Gly Tyr Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly 
835 840 845 

AGC CGC AAG ATC ATC CTC CGC CTC CAG GTC OCC AAG GGC AGC ACT GGT 1247 
Ser Arg Lys He He Leu Arg Leu Gin Val Pro Lys Gly Ser rflir Gly 
850 855 860 

GCC TAC CTC AGC GCC ATC GQC GGC TTC GCC AGC GAG AAG GAG ATC CTC 1295 
Ala Tyr Leu Ser Ala He Gly Gly Phe Ala Ser Glu Lys Glu He Leu 
865 870 875 

CTCGATAAGGACAQCAAGTACCACATCGACAAGGTCACCGAGGTCATC 1343 
Leu Asp Lys Asp Ser Lys Tyr His He Asp Lys Val Thr Glu Val He 
880 885 890 895 

ATC AAG GGC GTC AAG CGC TAC GTC GTC GAC QOC ACC CTC CTC ACC AAC 1391 
He Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn 
900 905 910 

TOC COG QGG CCT TCT ACT CCC CCA ACT CCC TCT CCT AGC ACQ CCT CCG 1439 
Ser Arg Gly Pro Ser Thr Pro Pro Thr Pro Ser Pro Ser Thr Pro Pro 
915 920 925 

ACA CCT AGC GAT ATC GGA TCC ACC ATC AAG ACC AAC CAG ATC AGC ACC 1487 
Thr Pro Ser Asp He Gly Ser Thr Met Lys Thr Asn Gin He Ser Thr 
930 935 940 
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ACC CAG AAG AAC CAG CAG AAG GAG ATG GAC CGC AAG GGC CTC CTC GGC 1535 
Thr Gin Lys Asn Gin Gin Lys Glu Met Asp Arg Lys Gly Leu Leu Gly 
945 950 955 

TAC TAC TIC AAG QGC AAG GAC TIC AGC AAC CIG ACC ATG TIC GCC CCC 1583 
Tyr TVr Phe Lys Gly Lys Asp Phe Ser Asn Leu Thr Met Phe Ala Pro 
960 965 970 975 

AOS CGT GAC AGC ACC CIG ATC TAC GAC CAG CAG ACC GCC AAC AAG CIG 1631 
Thr Arg Asp Ser Thr Leu He Tyr Asp Gin Gin Oto-Ala Asn Lys Leu 

980 -^985 990 - ^ 

CTGGftCAAGAAGCftGCAGGAGTACCAGAQCATCCGCTGGATCQGCCTC 1679 
Leu Asp Lys Lys Gin Gin Glu. Tyr Gin Ser He Arg Trp He Gly Leu 
99S 1000 1005 

ATC CAG AGC AAG GAG ACC GGC GAC TIC ACC TIC AAC CTG AGC GAG GAC 1727 
He Gin Ser Lys Glu Thr Gly Asp Phe Thr Phe Asn Leu Ser Glu Asp 
1010 1015 1020 

GAG CAG GCC ATC ATC GAG ATC AAC QGC AAG ATC ATC AGC AAC AAG GGC 1775 
Glu Gin Ala He He Glu He Asn Gly Lys He He Ser Asn Lys Gly 
1025 1030 1035 

AAG GAG AAG CAG GTG GTG CAC CTG GAG AAG GGC AAG CTG GTIG CCC ATC 1823 
Lys Glu Lys Gin Val Val His Leu Glu Lys Gly Lys Leu Val Pro He 
1040 1045 1050 1055 

AAG ATC GAG TAC CAG AGC GAC ACC AAG TIC AAC ATC GAC AGC AAG ACC 1871 
Lys He Glu Tyr Gin Ser Asp Thr Lys Phe Asn He Asp Ser Lys Thr 
1060 1065 1070 

TIC AAG GAG CTC AAG CTT TIC AAG ATC GAC AGC CAG AAC CAG CCC CAG 1919 
Phe Lys Glu Leu Lys Leu Phe Lys He Asp Ser Gin Asn Gin Pro Gin 
1075 1080 1085 

CAG GTC CAG CAG GAC GAG CTG CGC AAC CCC GAG TIC AAC AAG AAG GAG 1967 
Gin Val Gin Gin Asp Glu Leu Arg Asn Pro Glu Phe Asn Lys Lys Glu 
1090 1095 1100 

AGC CAG GAG TIC CTG GCC AAG CCC AGC AAG ATC AAC CTG TTC ACC CAG 2015 
Ser Gin Glu Phe Leu Ala Lys Pro Ser Lys He Asn Leu Kie Thr Gin 
1105 1110 1115 

CAG ATC AAG CGC GAG ATC GACGAGGACACCGACACCGACGGCGACAGC 2063 
Gin Met Lys Arg Glu He Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser 
1120 1125 1130 1135 

ATC CCC GAC CTG TOG GAG GAG AAC GGC TAC ACC ATC CAG AAC CGC ATC 2111 
He Pro Asp Leu Trp Glu Glu Asn Gly Tyr Thr He Gin Asn Arg He 
1140 1145 1150 



GCC GTC AAG TOG GAC GAC AGC CTC OCT AGC AAG GGC TAC ACC AAG TTC 
Ala Val Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe 



2159 
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1155 1160 1165 

GTCAXAACCXXCTGGMAXCACACCGTCQK 2207 
Val Ser Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr tbr Asp 
1170 1175 1180 

TAC GAG AAG GCC GCC CGC GAC CIG GAC CTG AGC AAC GCC AAG GAG ACC 2255 
Tyr Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr 
1185 1190 1195 

TTC AAC ax CTG GTG GCC GCC TTC OCC AGC GTG AAC GTG AGC ATG GAG 2303 
Phe Asn Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val-Ser Met Glu 
1200 1205 1210 " 1215 

AAG GTG ATC CTG AGC CCC AAC GAG AAC CTG AGC AAC AGC GTG GAG AGC 2351 
Lys Val lie Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser 
1220 1225 1230 

CAC TCG AGC ACC AAC TGG AGC TAC ACC AAC ACC GAG GGC GCC AGC GTS . 2399 
His Ser Ser Thr Asn Trp Ser Tyr Ihr Asn Thr Glu Gly Ala Ser Val 
1235 1240 1245 

GAGGOCGGCATCQCTCCCAAGQGCATCAQCTrcGGCGTGAQCGTG^ 2447 
Glu Ala Gly He Gly Pro Lys Gly He Ser Ete Gly Val Ser Val Asn 
1250 1255 1260 

TAC CAG CAC AGC GAG ACC GTG GCC CAG GAG TGG GGC ACC AGC ACC GGC 2495 
Tyr Gin His Ser Glu Thr Val Ala Gin Glu Trp Gly Ihr Ser Thr Gly 
1265 1270 1275 

AAC ACC AGC CAG TIC AAC ACC GCC AGC GCC GGC TAC CTG AAC GCC AAC 2543 
Asn Thr Ser Gin Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn 
1280 1285 1290 1295 

GTG OGC TAC AAC AAC GTG GGC ACC GGC GCC ATC TAC GAC GIG AAG CCC 2591 
Val Arg Tyr Asn Asn Val Gly Ttir Gly Ala He Tyr Asp Val Lys Pro 
1300 1305 1310 

ACC ACC AGC TTC GTG CTG AAC AAC GAC ACC ATC GCC ACC ATC ACC GCC 2639 
Thr Thr Ser Phe Val Leu Asn Asn Asp Thr He Ala Thr He Thr Ala 
1315 1320 1325 

AAG TOG AAT TOC AOC GCC CTG AAC ATC AGC CCC GGC GAG AGC TAC CCC 2687 
Lys Ser Asn Ser Thr Ala Leu Asn He Ser Pro Gly Glu Ser Tyr Pro 
1330 1335 1340 

AAG AAG GGC CAG AAC GGC ATC GCC ATC ACC AGC ATG GAC GAC TIC AAC 2735 
Lys Lys Gly Gin Asn Gly He Ala He Thr Ser Met Asp Asp Phe Asn 
1345 1350 1355 

AGC CAC CCC ATC ACC CTG AAC AAG AAG CAG GTG GAC AAC CTG CTG AAC 2783 
Ser His Pro He Thr Leu Asn Lys Lys Gin Val Asp Asn Leu Leu Asn 
1360 1365 1370 1375 

AAC AAG CCC ATG ATG CTG GAG ACC AAC CAG ACC GAC GGC GTC TAC AAG 2831 
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Asn Lys Pro Met Met Leu Glu Thr Asn Gin Thr Asp Gly Val Tyr Lys 
1380 1385 1390 

ATC AAG GAG ACC CAC QGC AAC ATC GTC /MDG QGC QGC GAG TX^ 2879 
He Lys Asp Thr His Gly Asn He Val Thr Gly Gly Glu Trp Asn Gly 
1395 1400 1405 

GOG ATC GAG GAG ATC AAG GOC AAG ACC GCr AGC ATC ATC GTC GAC GAC 2927 
Val He Gin Gin He Lys Ala Lys •flir Ala Ser He He Val Asp Asp 
1410 1415 1420 

QGCGAGOGCGTCGCrGAGAAGCGCGTGGCCGCC MG^XSPCT^ GAG AAC 2975 
Gly Glu-Tkrg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn 
1425 1430 1435 

CCCGAGGACAAGACCCCCAGCCTCACCCTGAAGGACGCCCTCAAGCTC 
Pro Glu Asp Lys Thr Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu 
1440 1445 1450 1455 

AGC TAG CCC GAC GAG ATC AAG GAG ATC GAG QOC TTC CTG TAG TAG AAG 3071 
Ser TVr Pro Asp Glu He Lys Glu He Glu Gly Leu Leu Tyr Tyr Lys 
1460 1465 1470 

AAC AAG OCC ATC TAG GAG AGO AQC GTC ATG AOC TAT CTA GAC GAG AAC 3119 
Asn Lys Pro He TVr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu Asn 
1475 1480 1485 

ACCGCCAAGGAGGTCACCAAGCAGCTCAACGACACCACCGQC 3167 
Thr Ala Lys Glu Val Thr Lys Gin Leu Asn Asp Thr Thr Gly Lys Phe 
1490 1495 1500 

AAGGACGTGAGCCACCTGTACGACGTOAAGCTGACCCCCAAGATGAAC 3215 
Lys Asp Val Ser His Leu Tyr Asp Val Lys Leu Thr Pro Lys Met Asn 
1505 1510 1515 

GTG ACC ATC AAG CTG AGO ATC CTG TAG GAC AAC GCC GAG AGC AAC GAC 3263 
Val Thr He Lys Leu Ser He Leu Tyr Asp Asn Ala Glu Ser Asn Asp 
1520 1525 1530 1535 

AAC AQC ATC QGC AAG TOG AOC AAC ACC AAC ATC GTG AGC GGC GGC AAC 3311 
Asn Ser He Gly Lys Ttp Thr Asn Thr Asn He Val Ser Gly Gly "Ash 
1540 1545 1550 

AACGGCAAGAAGCAGTACAQCAGCAACAACCCCGACGCCAACCTGACC 3359 
Asn Gly Lys Lys Gin Tyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr 
1555 1560 1565 

CTG AAC ACC GAC GCC GAG GAG AAG CIG AAC AAG AAC CGC GAC TAG TAG 3407 
Leu Asn Thr A^ Ala Gin Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr 
1570 1575 1580 

ATC AGC CTG TAC ATG AAG AGC GAG AAG AAC ACC GAG TGC GAG ATC ACC 3455 
He Ser Leu Tyr Met Lys Ser Glu Lys Asn Thr Gin Cys Glu He Thr 
1585 1590 1595 
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ATC GAC C3GC GAG ATA TAC CCC ATC ACC ACC AAG ACC CTTG AAC GTC AAC 3503 
lie Asp Gly Glu He Tyr Pro He Thr Thr Lys Thr Val Asn Val Asn 
1600 1605 1610 1615 

AAG GAC AAC TAC AAG CXX: CIG GAC ATC ATC OCX: CAC AAC ATC 3551 
Lys Asp Asn Tyr Lys Arg Leu Asp He He Ala His Asn He Lys Ser 
1620 1625 1630 

AAC CCC ATC AGC AGC CTG CAC ATC AAG ACC AAC GAC GAG ATC ACC CIG 3599 
Asn Pro He Ser Ser Leu His He Lys Thr Asn Asp Glu He Ihr Leu 
1635 _ 1640 1645 

T3?e-TGG GAC GAC^A TOG ATT ACC GAC GTC GCT AGC ATC AAG CCC GAG 3647 
Phe Trp Asp Asp He Ser He Thr Asp Val Ala Ser He Lys Pro Glu 
1650 1655 1660 

AAC CTG ACC GAC AGC GAG ATC AAG CAG ATA TAC AGT OGC TAC QGC ATC 3695 
Asn Leu Thr Asp Ser Glu He Lys Gin He Tyr Ser Arg Tyr Gly He 
1665 1670 1675 

AAG CTG GAG GAC GQC ATC CTG ATC GAC AAG AAA QGC QGC ATC CAC TAC 3743 
Lys Leu Glu Asp Gly He Leu He Asp Lys Lys Gly Gly He His Tyr 
1680 1685 1690 1695 

GGCGAGTTCATCAACGAGGCCAGCTTCAACATCGAGCCCCTCCAGAAC 3791 
Gly Glu Phe He Asn Glu Ala Ser Phe Asn He Glu Pro Leu Gin Asn 
1700 1705 1710 

TAC GTG ACC AAG TAC GAG GTC ACC TAC AGC AGC GAG CTC QGC CCC AAC 3839 
Tyr Val Thr Lys Tyr Glu Val Ihr Tyr Ser Ser Glu Leu Gly Pro Asn 
1715 1720 1725 



GTC AGC GAC ACC CTC GAG AGC GAC AAG ATT TAC AAG GAC GQC ACC ATC 
Val Ser Asp Thr Leu Glu Ser Asp Lys He Tyr Lys Asp Gly Thr He 
1730 1735 1740 



GACAQCQQCCTCAACTQGGACTICAAGATCAACGOCATC AC^TAC GAC 
Asp Ser Gly Asn Trp Asp Phe Lys He Asn Ala He Tte Tyr Asp 
1760 1765 1770 1775 



CT 



12) INP0RMATIC3N FOR SBQ ID ^3D:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 amino acids 



3887 



AAG TTC GAC TTC ACC AAG TAC AGC AAG AAC GAG CAG QGC CTC TIC TAC 3935 
Lys Phe Asp Phe Tfar Lys Tyr Ser Lys Asn Glu Gin Gly Leu Phe Tyr 
1745 1750 1755 



3983 



QGC AAG GAG ATC AAC GTC TIC CAC CGC TAC AAC AAG TAGATCIGAG 4029 
Gly Lys Glu Met Asn Val Phe His Arg Tyr Asn Lys 
1780 1785 



4031 
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(B) TYPE: amino acid 
(D) TOP0UX3Y: linear 

(ii) MOLETULE TOPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 50: 

Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu Gin 
15 10 15 

Val Val Thr Lys 233r Val Leu Leu Ser Thr Val Phe Ser He Ser Leu 
20 30 

Leu Asn Asn Glu Val He Lys Ala Glu Gin Leu Asn He Asn Ser Gin 
35 40 45 

Ser Lys Tyr Thr Asn Leu Gin Asn Leu Lys He Ihr Asp Lys Val Glu 
50 55 60 

Asp Ete Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys 
65 70 75 80 

Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn 
85 90 95 

Phe Leu Asp Asn Lys Asn Asp He Lys Thr Asn Tyr Lys Glu He Ohr 
100 105 110 

Phe Ser He Ala Gly Ser Phe Glu Asp Glu He Lys Asp Leu Lys Glu 
115 120 125 

He Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser He He Thr 
130 135 140 

Tyr Lys Asn Val Glu Pro Thr Thr He Gly Phe Asn Lys Ser Leu Thr 
145 150 155 160 

Glu Gly Asn Thr He Asn Ser Asp Ala Met Ala Gin Phe Lys Glu Gin 
165 170 175 

Phe Leu Asp Arg Asp He Lys Phe Asp Ser Tyr-lseu Asp Thr His Leu 
180 185 190 

Ihr Ala GLn Gin Val Ser Ser Lys Glu Arg Val He Leu Lys Val Thr 
195 . 200 205 

Val Pro Ser Gly Lys Gly Ser Ihr Thr Pro Tlir Lys Ala Gly Val He 
210 215 220 

I^u Asn Asn Ser Glu Tyr Lys Met Leu He Asp Asn Gly Tyr Met Val 
225 230 235 240 

His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu 
245 250 255 
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Gln He Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp He 
260 265 270 

Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala 
275 280 285 

Lys Asp Leu Thr Asp Ser Gin Arg Glu Ala Leu Asp Gly Tyr Ala Arg 
290 295 300 

Gin Asp lyr Lys Glu He Asn Asn Tyr Leu Arg Asn Gin Gly Gly Ser 
305 310 315 320 

Gly Asn Glu lys Leu Asp Ala Gin He Lys Asn He Ser Asp Ala Leu 
325 330 335 

Gly Lys Lys Pro He Pro Glu Asn He Thr Val Tyr Arg Trp Cys Gly 
340 345 350 

Met Pro Glu Phe Gly Tyr Gin He Ser Asp Pro Leu Pro Ser Leu Lys 
355 360 365 

Asp Phe Glu Glu Gin Phe Leu Asn Thr He Lys Glu Asp Lys Gly Tyr 
370 375 380 

Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg 
385 390 395 400 

Lys He He Leu Arg Leu Gin Val Pro Lys Gly Ser Thr Gly Ala Tyr 
405 410 415 

Leu Ser Ala He Gly Gly Phe Ala Ser Glu Lys Glu He Leu Leu Asp 
420 425 430 

Lys Asp Ser Lys Tyr His He Asp Lys Val Thr Glu Val He He Lys 
435 440 445 

Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn Ser Arg 
450 455 460 

Gly Pro Ser Thr Pro Pro Thr Pro Ser Pro Ser Thr Pro Pro Thr Pro 
465 470 475 480 

Ser Asp He Gly Ser Thr Met Lys Thr Asn Gin He Ser Thr Thr Gin 
485 490 495 

Lys Asn Gin Gin Lys Glu Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr 
' 500 505 510 

Phe Lys Gly Lys Asp Phe Ser Asn Leu Thr Met Phe Ala Pro Thr Arg 
515 520 525 

Asp Ser Thr Leu He Tyr Asp Gin Gin Thr Ala Asn Lys Leu Leu Asp 
530 535 540 



Lys Lys Gin Gin Glu Tyr Gin Ser He Arg Trp He Gly Leu He Gin 
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545 



550 



555 



560 



Ser Lys Glu Thr Gly Asp Phe Thr Phe Asn Leu Ser Glu Asp Glu Gin 
565 570 575 

Ala He He Glu He Asn Gly Lys He He Ser Asn Lys Gly Lys Glu 
580 585 590 

Lys Gin Val Val His Leu Glu Lys Gly Lys Leu Val Pro He Lys He 
595 600 605 

Glu Tyr Gin Ser Asp Tlir Lys Phe Asn He Asp^er Lys Iter Phe Lys 
610 — 615 620 

Glu Leu Lys Leu Phe Lys He Asp Ser Gin Asn Gin Pro Gin Gin Val 
625 630 635 640 

Gin -Gin Asp Glu Leu Arg Asn Pro Glu Phe Asn Lys Lys Glu Ser Gin 
645 650 655 

Glu Phe Leu Ala Lys Pro Ser Lys He Asn Leu Phe Thr Gin Gin Met 
660 665 670 

Lys Arg Glu He Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser He Pro 
675 680 685 

Asp Leu Trp Glu Glu Asn Gly Tyr Thr He Gin Asn Arg He Ala Val 
690 695 700 

Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser 
705 710 715 720 

Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu 
725 730 735 

Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr Phe Asn 
740 745 750 

Pro Leu Val Ala Ala PhB Pro Ser Val Asn Val Ser Met Glu Lys Val 
755 760 . 765 

He Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His Ser 
770 775 780 

Ser Thr Asn Trp Ser Tyr Thr Asn Thr Glu Gly Ala Ser Val Glu Ala 
785 790 795 800 

Gly He Gly Pro Lys Gly He Ser Phe Gly Val Ser Val Asn Tyr Gin 
805 810 815 

His Ser Glu Thr Val Ala Gin Glu Trp Gly Thr Ser Thr Gly Asn Thr 
820 825 830 



Ser Gin Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg 
835 840 845 • 
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Tyr Asn Asn Val Gly Itir Gly Ala He Tyr Asp Val Lys Pro Thr Thr 
850 855 860 

Ser Phe Val Leu Asn Asn Asp Thr He Ala Thr He Thr Ala Lys Ser 
865 870 875 880 

Asn Ser l!hr Ala Leu Asn He Ser Pro Gly Glu Ser Tyr Pro Lys Lys 
885 890 895 

Glyjaln Asn Gly He Ala He Thr Ser Met Asp Asp Phe Asn Ser His 
900 905 910 

Pro He Thr Leu Asn Lys Lys' Gin Val Asp Asn Leu Leu Asn Asn Lys 
915 920 925 

Pro Met Met Leu Glu Thr Asn Gin Thr Asp Gly Val Tyr Lys He Lys 
930 935 940 

A^ Thr His Gly Asn He Val Tlir Gly Gly Glu Trp Asn Gly Val He 
945 950 955 960 

Gin Gin He Lys Ala Lys Thr Ala Ser He He Val Asp Asp Gly Glu 
965 970 975 

Arg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu 
980 985 990 

Asp Lys Thr Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu Ser Tyr 
995 1000 1005 

Pro Asp Glu He Lys Glu He Glu Gly Leu Leu Tyr lyr Lys Asn Lys 
1010 1015 1020 

Pro He Tyr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu Asn Ihr Ala 
1025 1030 1035 1040 

Lys Glu Val Thr Lys Gin Leu Asn Asp Tbr Thr Gly Lys Phe Lys Asp 
1045 1050 1055 

Val Ser His Leu Tyr Asp Val Lys Leu Ttac Pro Lys Met Asn Val Thr 
1060 1065 1070 

He Lys Leu Ser He Leu Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser 
1075 1080 1085 

lie Gly Lys Trp Thr Asn Thr Asn He Val Ser Gly Gly Asn Asn Gly 
1090 1095 1100 

Lys Lys Gin Tyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn 
1105 1110 1115 1120 

Thr Asp Ala Gin Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr lie Ser 
1125 1130 1135 
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Leu Tyr Met Lys Ser Glu Lys Asn Thr Gin Cys Glu He Thr He Asp 
1140 1145 1150 

Gly Glu He Tyr Pro He Thr Thr Lys nix Val Asn Val Asn Lys Asp 
1155 1160 1165 

Asn Tyr Lys Arg Leu Asp He He Ala His Asn He Lys Ser Asn Pro 
1170 1175 1180 

He Ser Ser Leu His He Lys Thr Asn Asp Glu lie Thr Leu I^e Trp 
1185 1190 _ 1195 1200 

Asp A^ He Ser He Thr Asp Val Ala Ser He Lys Pro Glu Asn Leu 
1205 1210 1215 

Thr Asp Ser Glu He Lys Gin He Tyr Ser Arg Tyr Gly He Lys Leu 
1220 1225 1230 

Glu Asp Gly He Leu He Asp Lys Lys Gly Gly He His Tyr Gly Glu 
1235 1240 1245 

Phe He Asn Glu Ala Ser Phe Asn He Glu Pro Leu Gin Asn Tyr Val 
1250 1255 1260 

Thr Lys Tyr Glu Val Thr Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser 
1265 1270 1275 1280 

Asp Thr Leu Glu Ser Asp Lys He Tyr Lys Asp Gly Thr He Lys Phe 
1285 1290 1295 

Asp Phe Thr Tyr Ser Lys Asn Glu Gin Gly Leu Rie Tyr Asp Ser 
1300 1305 1310 

Gly Leu Asn Trp Asp Phe Lys He Asn Ala He Thr Tyr Asp Gly Lys 
1315 1320 1325 

Glu Met Asn Val Phe His Arg Tyr Asn Lys 
1330 1335 

(2) INPQRMATICK FOR SEQ ID NO: 51: 

(i) SEQUe«:E CHARACTERISTICS: 

(A) LENGTH: 2444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ENA (gaiomic) 
(iii) HYPOIHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 2444 
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(D) OTHEK INFXWMATION: /product= "3A(a) synthetic: native 

fusion- 



(xi) SBQUEMirE IXSCRIPriCN: SBQ ID N0:51: 

QGATCCACCA ATGAAC ATG AAC AAG AAC AAC ACC AAG CTG AGC ACC CGC 49 
Met Asn Lys Asn Asn Thr Lys Leu Ser 'Thr Arg 
15 10 

GCC CTG CCG AGC TIC ATC GAC TOO TTC AAC GQC ATC TAC GGC TTC GCC 97 
Ala Leu Pro Ser ^Kie lie Asp lyr I*ie Asn Gly Jle-lVr Gly Phe Ala 
15 20 25 

AOC GQC ATC AAG GAC ATC ATG AAC ATG ATC TTC AAG ACC GAC ACC GGC 145 
Thr Gly He Lys Asp He Met Asn Met He Hie Lys Thr Asp Ihr Gly 
30 35 40 

GGC GAC CPS ACC CTG GAC GAG ATC CTG AAG AAC CAG GAG CTC CIG AAC 193 
Gly Asp Leu Ohr Leu Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn 
45 50 55 

GAC ATC AGC GGC AAG CTG GAC GQC GTG AAC GGC AGC CTG AAC GAC CTC 241 
Asp He Ser Gly Lys Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu 
60 65 70 75 

ATC GCC CAG GGC AAC CTC AAC ACC GAG CTC AGC AAG GAG ATC CTT AAG 289 
He Ala Gin Gly Asn Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys 
80 85 90 

ATC GCC AAC GAG CAG AAC CAG GTG CTC AAC GAC GTC AAC AAC AAG CTC 337 
He Ala Asn Glu Gin Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu 
95 100 105 

GAC GCC ATC AAC AOC ATC CTC OGC GTC TAC CTC COG AAG ATC ACC AGC 385 
Asp Ala He Asn Ihr Met Leu Arg Val Tyr Leu Pro Lys He Thr Ser 
110 115 120 

ATC CTG AGC GAC CTG ATC AAG CAG AAC TAC GCC CTC AGC CTC CAG ATC 433 
Met Leu Ser Asp Val Met Lys Gin Asn Oyr Ala Leu Ser Leu Gin He 
125 130 135 

GAG TAC CTG AGC AAG CAG CTC CAG GAG ATC AGC GAC AAG CTC GAC ATC 481 
Glu Tyr Leu Ser Lys Gin Leu Gin Glu He Ser Asp Lys Leu Asp He 
140 145 150 155 

ATC AAC GTC AAC GTC CTC ATC AAC AGC AOC CTC ACC GAG ATC ACC COG 529 
He Asn Val Asn Val Leu He Asn Ser Ttir Leu Thr Glu He Ohr Pro 
160 165 170 

GOCTACCAGOGCATCAAGTACGTCAACGAGAAGTTC GAA GAG CTC ACC - 577 

Ala lyr Gin Arg He Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr 

175 180 185 

TIC GCC ACC GAG ACC AGC AGC AAG GTC AAG AAG GAC GGC AGC CCG GCC 625 



wo 97/46105 



PCT/EP97/02737 



- 140- 



Phe Ala l-hr Glu Ttu: Ser Scr Lys Val Lys Lys Asp Gly Ser Pro Ala 
190 195 200 

GAC ATC CTG GAC GAG CTG ACC GAG CTG ACT GAG CTC GCC AAG AGC GTO 673 
Asp lie Leu Asp Glu Leu «ir Glu Leu -rtir Glu Leu Ala Lys Ser Val 
205 210 215 

ACCAAGAACGBWrGTCGACGGCTTCGAGTrcTACCTGAACACCTTCCaC 721 
Thr Lys Asn Asp Val Gly Ehe Glu Phe Tyr Leu Asn -nir Phe His 
220 225 230 235 

GAC_CTG ATG GTO ^ AAC AAC CTC TKrGGC OSC AGO OCC CTC AAG_ACC '~T69 
Asp Val Met Val Gly Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr 
240 245 250 

GCCAGCGAGCTGATCAOCAflGGAGAACGTGAAGACCAGCQQCAQCGAG 817 
Ala Ser Glu Leu lie Thr Lys Glu Asn Val Lys Tiir Ser Gly Ser Glu 
255 260 265 

GTC QGC AAC GlGTACAACTTCCTCATCGTCCTCACrGCrCTCCAGGCC 
Val Gly Asn Val IVr Asn Hie Leu He Val Leu Thr Ala Leu Gin Ala 
270 275 280 

CAGQCTTICCIGACCCTGACCACCTGTaSCAAGCTCCTCGGCCrcGCC 913 
Gin Ala Phe I^u Thr Leu Hir Thr Cys Arg Lys L©J Leu Gly Leu Ala 
285 290 295 



GACATCGACTftCACCAGCATCATGAACQAGCACTTGAACAAGGAGAflG 
Asp He Asp Tyr Thr Ser He Met Asn Glu His Leu Asn Lys Glu Lvs 
300 305 310 315 

GAGGAGTICOGCGTGAACATCCTCCXJGACrCTGAGCAACACCTrcAQC 
Glu Glu PhB Arg Val Asn He Leu Pro Thr Leu Ser Asn Thr ttie Ser 
320 325 330 



865 



961 



1009 



AACCtGAACTACGCCAAGGTCAAGGQCAQCGACGftGGACQCCAAGATC 1057 
Asn Pro Asn Tyr Ala Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met 
335 340 345 

ATCGTGGAGGCTAAGCCGQQCCACGCGTroATCGGCTICGAGATCAQC 1105 
He Val Glu Ala Lys Pro Gly His Ala Leu He-Gly Phe Glu He Ser 
350 355 360 

AAC GAC AGC ATC ACC GfTO CTG AAG GTC TAG GAG GCX: AAG CTC AAG CAG 1153 
Asn Asp Ser He Thr Val Lai Lys Val Tyr Glu Ala Lys Leu Lys Gin 
365 370 375 

AACTACCAGGrrGGACAAGGACAGCTTGAQCGAGGTCATCTACGQCGAC 1201 
Asn lyr Gin Val Asp Lys Asp Ser Leu Ser Glu Val He Tyr Gly Asp 
380 385 390 395 

ATG GAC AAG CTt5 CTC TCT CXDG GAC CAG AGC GAG CAA ATC TAC TAC ACC 1249 
Met Asp Lys Leu Leu Cys Pro Asp Gin Ser Glu Gin lie Tyr Tyr Thr 
400 405 410 
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AAC AAC ATC GVG TIC CXX5 AAC GAG TAG GTC ATC ACQ AAG ATC GAG TTC 1297 
Asn Asn lie Val Phe Pro Asn Glu Tyr Val lie Thr Lys He Asp Phe 
415 420 425 

ACC AAG AAG ATG AAG ACC CTC CGC TAG GAG GTG ACC GCC AAC TTC TAG 1345 
Thr Lys Lys Met Lys Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe lyr 
430 435 440 

GACAGCAQCACCGQCGAGATCGACCIGAACAAGAAGAAGGTCG^ 1393 
Asp Ser Ser Thr Gly Glu He Asp Leu Asn Lys Lys Lys Val Glu Ser 

445 _ 450 455 - 

AGC GAG GCC GAG TAG OGC ACC CTG AQC GOG AAC GAG GAC QGC GTC TAG 1441 
Ser Glu Ala Glu Tyr Arg Ihr Leu Ser Ala Asn Asp Asp Gly Val lyr 
460 465 470 475 

ATG CCA CTG GGC GTG ATC AGC GAG ACC TTC CTG ACC CCG ATC AAC GQC 1489 
Met Pro Leu Gly Val lie Ser Glu Thr Phe Leu Thr Pro He Asn Gly 
480 485 490 

TIT QGC CTG CAG GCC GAC GAG AAC AGC OGC CTG ATC ACC CTG ACC TCT 1537 
Phe Gly Leu Gin Ala Asp Glu Asn Ser Arg Leu He Thr Leu Thr Cys 
495 500 505 

AAG AGC TAC CTG OGC GAG CTG CTG CTA GCC ACC GAC CTG AGC AAC AAG 1585 
Lys Ser Tyr Leu Arg Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys 
510 515 520 

GAG ACC AAG CTG ATC GTG CCA OCXS AOC QGC TTC ATC AGC AAC ATC GTG 1633 
Glu Thr Lys Leu He Val Pro Pro Ser Gly Phe He Ser Asn He Val 
525 530 535 

GAG AAC GGC AQC ATC GAG GAG GAC AAC CTG GAG COG TGG AAG GCC AAC 1681 
Glu Asn Gly Ser He Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn 
540 545 550 555 

AAC AAG AAC GCC TAC GTG GAC CAC ACC GQC GGC GIG AAC GGC ACC AAG 1729 
Asn Lys Asn Ala Tyr Val Asp His Thr Gly Gly Val Asn Gly Thr Lys 
560 565 570 

GCC CTG TAC GTG CAC AAG GAC GGC GGC ATC AQC-CAG TTC ATC GGC GAC 1777 
Ala Leu Tyr Val His Lys Asp Gly Gly He Ser Gin Phe He Gly Asp 
575 580 585 

AAG CTG AAG COG AAG ACC GAG TAC GTG ATC CAG TAC ACC CTG AAG GQC 1825 
Lys Leu Lys Pro Lys Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly 
590 595 600 

AAG CCA TOG ATT CAC CTG AAG GAC GAG AAC ACC QGC TAC ATC CAC TAC 1873 
Lys Pro Ser He His Leu Lys Asp Glu Asn Thr Gly TVr He His TVr 
605 610 615 

GAG GAC ACC AAC AAC AAC CTG GAG GAC TAC CAG ACC ATC AAC AAG OGC 1921 
Glu Asp Thr Asn Asn Asn Leu Glu Asp Tyr Gin Thr He Asn Lys Arg 
620 625 630 635 
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TTC ACC ACT GGC ACC GAC CTC AAG GGC GTC TAC CTG ATC CTC AAG AQC 1969 
Phe Thr Thr Gly Thr Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser 
640 645 650 

C»GAACQQCGACGAGGCX:TQGQQCGACAACTrcATCATCCTC^ 2017 
Gin Asn Gly Asp Glu Ala Ttp Gly Asp Asn Phe He He Leu Glu He 
655 660 665 

AGCCOGAGCGAGAAGCTGCTGAGCODGGAGCrc 2065 

Ser Pro Ser Glu Lys Leu Leu Ser Pro Glu Leu He Asn Thr Asn Asn 

670 675- 680 

TQGACCAGCACCGGCAQCACCAACATCAGCGCX:AACACrCTGAC^ 2113 
Trp Thr Ser 'Ehr Gly Ser Thr Asn He Ser Gly Asn 'Thr Leu Thr Leu 
685 690 695 

TAG CAG GGC GGC CGG GOG ATT CTA AAA CAA AAC CTT CAA TTA GAT AGT 2161 
Tyr Gin Gly Gly Arg Gly He Leu Lys Gin Asn Leu Gin Leu Asp Ser 
700 705 710 715 

TTT TCA ACT TAT AGA GTC TAT TIT TCT GTG TCXT GGA GAT OCT AAT GTA 2209 
Phe Ser Thr Tyr Arg Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val 
720 725 730 

AQG ATT AGA AAT TCT AQG GAA GTC TTA TTT GAA AAA AGA TAT ATC AGO 2257 
Arg He Arg Asn Ser Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser 
735 740 745 

GGT GCT AAA GAT G?IT TCT GAA ATG TTC ACT ACA AAA TTT GAG AAA GAT 2305 
Gly Ala Lys Asp Val Ser Glu Met Phe Thr Thr Lys Ehe Glu Lys Asd 
750 755 760 

AAC TIT TAT ATA GAG CTT TCT CAA GOG AAT AAT TTA TAT GGT GGT CCT 2353 
Asn Phe Tyr He Glu Leu Ser Gin Gly Asn Asn Leu Tyr Gly Gly Pro 
765 770 775 

ATT GTA CAT TTT TAC GAT GTC TCT ATT AAG NAA GAT CX3G GAT CTA ATA 2401 
He Val His Phe Tyr Asp Val Ser He Lys Xaa Asp Arg Asp Leu He 
780 785 790 795 " 

TTA ACA GTT TTT AAA AGC NAA TTC TIG TAT AAT GTC CTT GAT T 2444 
Leu Thr Val Kie Lys Ser Xaa Phe Leu Tyr Asn Val Leu Asp 
800 605 



(2) INFORMATION FOR SD3 ID N3:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Asn Lys Asn Asn Thr Lys Leu Ser TUnr Arg Ala Leu Pro Ser Phe 
15 10 15 

lie Asp Tyr Phe Asn Gly He Tyr Gly Phe Ala Ihr Gly He Lys Asp 
20 25 30 

He Met Asn Met He Phe Lys Ihr Asp Thr Gly Gly Asp Leu Thr Leu 
35 40 45 

Asp Glu He Leu Lys AsrTGln Gin Leu Leu-Asn Asp He Ser Gly Lys 
50 55 60 

Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 
65 70 75 80 

Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 

Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Thr 
100 105 110 

Met Leu Arg Val Tyr Leu Pro Lys He Thr Ser Met Leu Ser Asp Val 
115 120 125 

Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Leu Ser Lys 
130 135 140 

Gin Leu Gin Glu He Ser Asp Lys Leu Asp He He Asn Val Asn Val 
145 150 155 160 

Leu He Asn Ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin Arg He 
165 170 175 

Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr 
180 185 190 

Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp He Leu Asp Glu 
195 200 205 

Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
210 215 220 

Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly 
225 230 235 240 

Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
245 250 255 

Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr 
260 265 270 

Asn Phe Leu He Val Leu Thr Ala Leu Gin Ala Gin Ala Phe Leu Thr 

275 280 285 
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Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp He Asp lyr Thr 
290 295 300 

Ser He Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val 
305 310 315 320 

Asn He Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
325 330 335 

Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met He \^ Glu Ala Lys 
340 345 350 

Pro Gly His Ala Leu He Gly Phe Glu He Ser Asn Asp Ser He Ihr 
355 360 365 

Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin Val Asp 
370 375 380 

Lys Asp Ser Leu Ser Glu Val He Tyr Gly Asp Met Asp Lys Leu Leu 
385 390 395 400 

cys Pro Asp Gin Ser Glu Gin He Tyr Tyr Thr Asn Asn He Val Phe 
405 410 415 

Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Hhr Lys Lys Met Lys 
420 425 430 

Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
435 440 445 

Glu He Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu . Ala Glu TVr 
450 455 460 

Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
465 470 475 480 

He Ser Glu Thr Phe Leu Thr Pro He Asn Gly Phe Gly Leu Gin Ala 
485 490 495 

Asp Glu Asn Ser Arg Leu He Thr Leu Thr Cys Lys Ser Tyr Leu Arg 
500 505 510 

Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu He 
515 520 525 

Val Pro Pro Ser Gly Phe He Ser Asn He Val Glu Asn Gly Ser He 
530 535 540 

Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr 
545 550 555 560 

Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His 
565 570 575 
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Lys Asp Gly Gly He Ser Gin Phe He Gly Asp Lys Leu Lys Pro Lys 
580 585 590 

Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser He His 
595 600 605 

Leu Lys Asp Glu Asa Thr Gly Tyr He His oyr Glu Asp Thr Asn Asn 
610 615 620 

Asn Leu Glu Asp lyr Gin Thr He Asn Lys Arg Phe Thr llir Gly Ihr 
625 630 635 640 

Asp Leu Lys Gly ValJTyr Leu He Leu Lys Ser Gin Asn Gly Asp Glu 
645 650 655 

Ala Trp Gly Asp Asn Phe He He Leu Glu He Ser Pro Ser Glu Lys 
660 665 670 

Leu Leu Ser Pro Glu Leu He Asn Hir Asn Asn Trp Ihr Ser Thr Gly 
675 680 685 

Ser Thr Asn He Ser Gly Asn Thr Leu Thr Leu Tyr Gin Gly Gly Arg 
690 695 700 

Gly He Leu Lys Gin Asn Leu Gin Leu Asp Ser Phe Ser Thr Tyr Arg 
705 710 715 720 

Val lyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg He Arg Asn Ser 

725 730 735 

Arg Glu Val Leu Phe Glu Lys Arg lyr Met Ser Gly Ala Lys Asp Val 
740 745 750 

Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr lie Glu 
755 760 765 

Leu Ser Gin Gly Asn Asn Leu Tyr Gly Gly Pro He Val His Phe Tyr 
770 775 780 

Asp Val Ser He Lys Xaa Asp Arg Asp Leu He Leu Thr Val Phe Lys 
785 790 795 800 



Ser Xaa Phe Leu Tyr Asn Val Leu Asp 
805 
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(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : -tinear — 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pure maize optimized synthetic BT CrylA(b) gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
60 

GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
120 

AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
180 . 

GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
240 

GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
300 

GAGGGCCTGA GCAACCTGTA CCAGATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
360 
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CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC 
420 

CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
480 

TACGTOCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTGAGCGT GTTCGGCCAG 
540 

CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
600 

GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGC 
660 

CCCGACAGCC GCGACTGGAT CCGCTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
720 

CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
.780 

AGCCAGCTGA CCCGCGAGAT CTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
840 

CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
900 

AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
960 

ATCATGGCCA GCCCCGTGGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
1020 

ATGGGCAACG CCGCCCCCCA GCAGCGCATC GTGGCCCAGC TGGGCCAGGG CGTGTACCGC 
1080 
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ACCCTGAGCA GCACCCTGTA CCGCCGCCCC TTCAACATCG GCATCAACAA CCAGCAGCTG 
1140 

AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
1200 

TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCCCAGAA CAACAACGTG 
1260 — 

CCCCCCCGCC AGGGCTTCAG CCACCGCCTG AGCCACGTGA GCATGTTCCG CAGCGGCTTC 
1320 

AGCAACAGCA GCGTGAGCAT CATCCGCGCC CCCATGTTCA GCTGGATCCA CCGCAGCGCC 
1380 

GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
1440 

AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
1500 

CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
1560 

CAGCGCTACC GCGTGCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
1620 

ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
1680 

CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
1740 

AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC 
1800 
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CGCATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGCGCGCC 
1860 

CAGAAGGCCG TGAACGAGCT GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 
1920 

ACCGACTACC ACATCGACCA GGTGAGCAAC CTGGTGGAGT GCCTGAGCGA CGAGTTCTGC 
1980 — 

CTGGACGAGA AGAAGGAGCT GAGCGAGAAG GTGAAGCACG CCAAGCGCCT GAGCGACGAG 
2040 

CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCATCAACC GCCAGCTGGA CCGCGGCTGG 
2100 

CGCGGCAGCA CCGACATCAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAACTACGTG 
2160 

ACCCTGCTGG GCACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA GATCGACGAG 
2220 

AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
2280 

CTGGAGATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCACC 
2340 

GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC CCCATCGGCA AGTGCGCCCA CCACAGCCAC 
2400 

CACTTCAGCC TGGACATCGA CGTGGGCTGC ACCGACCTGA ACGAGGACCT GGGCGTGTGG 
2460 

GTGATCTTCA AGATCAAGAC CCAGGACGGC CACGCCCGCC TGGGCAACCT GGAGTTCCTG 
2520 
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GAGGAGAAGC CCCTGGTGGG CGAGGCCCTG GCCCGCGTGA AGCGCGCCGA GAAGAAGTGG 
2SB0 

CGCGACAAGC GCGAGAAGCT GGAGTGGGAG ACCAACATCG TGTACAAGGA GGCCAAGGAG 
2640 

AGCGTGGACG CCCTGTTCGT GAACAGCCAG TACGACCGCC TGCAGGCCGA CACCAACATC 

2700 — — ~ 

GCCATGATCC ACGCCGCCGA CAAGCGCGTG CACAGCATCC GCGAGGCCTA CCTGCCCGAG 
2760 

CTGAGCGTGA TCCCCGGCGT GAACGCCGCC ATCTTCGAGG AGCTGGAGGG CCGCATCTTC 
2820 

ACCGCCTTCA GCCTGTACGA CGCCCGCAAC GTGATCAAGA ACGGCGACTT CAACAACGGC 
2880 

CTGAGCTGCT GGAACGTGAA GGGCCACGTG GACGTGGAGG AGCAGAACAA CCACCGCAGC 

2940 

GTGCTGGTGG TGCCCGAGTG GGAGGCCGAG GTGAGCCAGG AGGTGCGCGT GTGCCCCGGC 
3000 

CGCGGCTACA TCCTGCGCGT GACCGCCTAC AAG6AGGGCT ACGGCGAGGG CTGCGTGACC 
3060 

ATCCACGAGA TCGAGAACAA CACCGACGAG CTGAAGTTCA GCAACTGCGT GGAGGAGGAG 
3120 

GTGTACCCCA ACAACACCGT GACCTGCAAC GACTACACCG CCACCCAGGA GGAGTACGAG 
3180 

GGCACCTACA CCAGCCGCAA CCGCGGCTAC GAC6GCGCCT ACGAGAGCAA CAGCAGCGTG 
3240 
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CCCGCCGACT ACGCCAGCGC CTACGAGGAG AAGGCCTACA CCGACGGCCG CCGCGACAAC 
3300 

CCCTGCGAGA GCAACCGCGG CTACGGCGAC TACACCCCCC TGCCCGCCGG CTACGTGACC 
3360 

AAGGAGCTGG AGTACTTCCC CGAGACCGAC AAGGTGTGGA TCGAGATCGG CGAGACCGAG 
3420 — 

GGCACCTTCA TCGTGGACAG CGTGGAGCTG CTGCTGATGG AGGAGTAGTA CATG 
3474 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iv) ANTI -SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Full length synthetic maize optimized BT CryIA{b) 
gene 

(xi) SEQUENCE DESCRIPTION: SEQ ID, N0T54 : 

GATCCAACAA TGGACAACAA CCCCAACATC AACGAGTGCA TCCCCTACAA CTGCCTGAGC 
60 

AACCCCGAGG TGGAGGTGCT GGGCGGCGAG CGCATCGAGA CCGGCTACAC CCCCATCGAC 
120 

ATCAGCCTGA GCCTGACCCA GTTCCTGCTG AGCGAGTTCG TGCCCGGCGC CGGCTTCGTG 
180 
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CTGGGCCTGG TGGACATCAT CTGGGGCATC TTCGGCCCCA GCCAGTGGGA CGCCTTCCTG 
240 

GTGCAGATCG AGCAGCTGAT CAACCAGCGC ATCGAGGAGT TCGCCCGCAA CCAGGCCATC 
300 

AGCCG.CCTGG AGGGCCTGAG CAACCTGTAXTCAAATCTACG CCGAGAGCTT CCGCGAGTGG 
360 

GAGGCCGACC CCACCAACCC CGCCCTGCGC GAGGAGATGC GCATCCAGTT CAACGACATG 
420 

AACAGCGCCC TGACCACCGC CATCCCCCTG TTCGCCGTGC AGAACTACCA ggtgcccctg 
460 

ctgagcgtgt acgtgcaggc cgccaacctg cacctgagcg tgctgcgcga cgtcagcgtg 

540 

TTCGGCCAGC GCTGGGGCTT CGACGCCGCC ACCATCAACA GCCGCTACAA CGACCTGACC 
600 

CGCCTGATCG GCAACTACAC CGACCACGCC gtgcgctggt acaacaccgg cctggagcgc 
660 

gtgtggggtc ccgacagccg cgactggatc aggtacaacc agttccgccg cgagctgacc 

720 

ctgaccgtgc tggacatcgt gagcctgttc cccaactacg acagccgcac ctaccccatc 

780 

CGCACCGTGA GCCAGCTGAC CCGCGAGATT TACACCAACC CCGTGCTGGA GAACTTCGAC 
840 

GGCAGCTTCC GCGGCAGCGC CCAGGGCATC GAGGGCAGGA TCCGCAGCCC CCACCTGATG 
900 
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GACATCCTGA ACAGCATCAC CATCTACACC GACGCCCACC GCGGCGAGTA CTACTGGAGC 
960 

GGCCACCAGA TCATGGCCAG CCCCGTCGGC TTCAGCGGCC CCGAGTTCAC CTTCCCCCTG 
1020 

TACGGCACCA TGGGCAACGC TGCACCTCAG CAGCGCATCG TGGCACAGCT GGGCCAGGGA 
1080 ^ _ . 

GTGTACCGCA CCCTGAGCAG CACCCTGTAC CGTCGACCTT TCAACATCGG CATCAACAAC 
1140 

CAGCAGCTGA GCGTGCTGGA. CGGCACCGAG TTCGCCTACG GCACCAGCAG CAACCTGCCC 

1200; 

AGCGCCGTGT ACCGCAAGAG CGGCACCGTG GACAGCCTGG ACGAGATCCC CCCTCAGAAC 
1260 

AACAACGTGC CACCTCGACA GGGCTTCAGC CACCGTCTGA GCCACGTGAG CATGTTCCGC 

1320 

AGTGGCTTCA GCAACAGCAG CGTGAGCATC ATCCGTGCAC CTATGTTCAG CTGGATTCAC 
1380 

CGCAGTGCCG AGTTCAACAA CATCATCCCC AGCAGCCAGA TCACCCAGAT CCCCCTGACC 
1440 

AAGAGCACCA ACCTGGGCAG CGGCACCAGC GTGGTGAAGG GCCCCGGCTT CACCGGCGGC 
1500 

GACATCCTGC GCCGCACCAG CCCCGGCCAG ATCAGCACCC TGCGCGTGAA CATCACCGCC 
1560 

CCCCTGAGCC AGCGCTACCG CGTCCGCATC CGCTACGCCA GCACCACCAA CCTGCAGTTC 
1620 
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CACACCAGCA TCGACGGCCG CCCCATCAAC CAGGGCAACT TCAGCGCCAC CATGAGCAGC 
1680 

GGCAGCAACC TGCAGAGCGG CAGCTTCCGC ACCGTGGGCT TCACCACCCC CTTCAACTTC 
1740 

AGCAACGGCA GCAGCGTGTT CACCCTGAGC GCCCACGTGT TCAACAGCGG CAACGAGGTG 

1800" _ " 

TACATCGACC GCATCGAGTT CGTGCCCGCC GAGGTGACCT TCGAGGCCGA GTACGACCTG 
1860 

GAGAGGGCTC AGAAGGCCGT GAACGAGCTG TTCACCAGCA GCAACCAGAT CGGCCTGAAG 
1920 

ACCGACGTGA CCGACTACCA CATCGATCAG GTGAGCAACC TGGTGGAGTG CCTGAGCGAC 
1980 

GAGTTCTGCC TGGACGAGAA GAAGGAGCTG AGCGAGAAGG TGAAGCACGC CAAGCGCCTG 

2040 

AGCGACGAGC GCAACCTGCT GCAGGACCCC AACTTCCGCG GCATCAACCG CCAGCTGGAC 
2100 

CGCGGCTGGC GCGGCAGCAC CGACATCACC ATCCAGGGCG GCGACGACGT GTTCAAGGAG 
2160 

AACTACGTGA CCCTGCTGGG CACCTTCGAC GAGTGCTACC CCACCTACCT GTACCAGAAG 
2220 

ATCGACGAGA GCAAGCTGAA GGCCTACACC CGCTACCAGC TGCGCGGCTA CATCGAGGAC 
2280 

AGCCAGGACC TGGAGATCTA CCTGATCCGC TACAACGCCA AGCACGAGAC CGTGAACGTG 
2340 
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CCCGGCACCG GCAGCCTGTG GCCCCTGAGC GCCCCCAGCC CCATCGGCAA GTGCGCCCAC 
2400 

CACAGCCACC ACTTCAGCCT GGACATCGAC GTGGGCTGCA CCGACCTCAA CGAGGACCTG 
2460 

GGCGTGTGGG TGATCTTCAA GATCAAGACC CAGGACGGCC ACGCCCGCCT GGGCAACCTG 

2520 

GAGTTCCTGG AGGAGAAGCC CCTGGTGGGC GAGGCCCTGG CCCGCGTGAA GCGCGCCGAG 
2580 

AAGAAGTGGC GCGACAAGCG CGAGAAGCTG GAGTGGGAGA CCAACATCGT GTACAAGGAG 
2640 

GCCAAGGAGA GCGTGGACGC CCTGTTCGTG AACAGCCAGT ACGACCGCCT GCAGGCCGAC 
2700 

ACCAACATCG CCATGATCCA CGCCGCCGAC AAGCGCGTGC ACAGCATTCG CGAGGCCTAC 
2760 

CTGCCCGAGC TGAGCGTGAT CCCCGGCGTG AACGCCGCCA TCTTCGAGGA GCTGGAGGGC 
2820 

CGCATCTTCA CCGCCTTCAG CCTGTACGAC GCCCGCAACG TGATCAAGAA CGGCGACTTC 
2880 

AACAACGGCC. TGAGCTGCTG GAACGTGAAG GGCCACGTGG ACGTGGAGGA GCAGAACAAC 
2940 

CACCGCAGCG TGCTGGTGGT GCCCGAGTGG GAGGCCGAGG TGAGCCAGGA GGTGCGCGTG 
3000 

TGCCCCGGCC GCGGCTACAT CCTGCGCGTG ACCGCCTACA AGGAGGGCTA C6GCGAGGGC 
3060 
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TGCGTGACCA TCCACGAGAT CGAGAACAAC ACCGACGAGC TCAAGTTCAG CAACTGCGTG 
3120 

GAGGAGGAGG TGTACCCCAA CAACACCGTG ACCTGCAACG ACTACACCGC CACCCAGGAG 
3180 

GAGTACGAGG GCACCTACAC CAGCCGCAAC CGCGGCTACG ACGGCGCCTA CGAGAGCAAC 
3240 — 

AGCAGCGTGC CCGCCGACTA CGCCAGCGCC TACGAGGAGA AGGCCTACAC CGACGGCCGC 
3300 

CGCGACAACC CCTGCGAGAG CAACCGCGGC TACGGCGACT ACACCCCCCT GCCCGCCGGC 
3360 

TACGTGACCA AGGAGCTGGA GTACTTCCCC GAGACCGACA AGGTGTGGAT CGAGATCGGC 
3420 

GAGACCGAGG GCACCTTCAT CGTGGACAGC GTGGAGCTGC TGCTGATGGA GGAGTAGTAC 
3480 

ATGTGATAGT ACGTAAGCTC GAGGATCT 3508 



(2) mFOPmrim for SBQ id N0:55: 

(i) SEQUENCE CMARACTERISTICS : 

(A) LENGnni: 1961 base pairs 

(B) TVPE: nucleic acid 

(C) OTRANDECNESS; single 

(D) TOEOLOGY: linear 

(ii) MOLECULE TYPE: CNA 
{ill) HYPOTHETICAL: NO 
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(iv) AOTI-SHOSE: NO 

(vi) ORIGINAL SOURCE: 

(A) CM3ANISM: Truncated synthetic maize optimized BT 
CrylA(b) gene 

{xi)-5BQUENCE DESGRIPITON: SEQ ID NO: 55: 

GATCCAACAA TCGACAACAA (XOCAACATC AA0GAGTC3CA TCOXTACAA CTGCCTGAGC 60 

AA00CXX3AGG TQGAGGrPGCT GQQOQQCJGAG CXXATOGAGA OCQGCTACAC CCCXATOCSAC 120 

aicagcx:tga ocxrroACxrA gitoctqctg asogagttog tgccxxsgoqc ooGcriasTO lao 

CTOQQCCTQG TQC3ACATCAT CTQOQQCATC TTOQQCCOCA GCCAGTOQGA OGCCnu:iU 240 

GTQCAGATCG AGCAGCTGAT CAACCAQOQC ATOGAGGAGT TQGCCOQCAA CCAQCXXMC 300 

AGCGGCCTGG AGQGCCTGAG CAACCTGTAC CAAATCTACG CCGAGAGCTT COQOGAGTGG 360 

GAGGCXXSACC CCACCAACCC CXSOCCrOOQC GMGAGATGC GCATCCACTTT CAACGACATC 420 

AACAG0C3CCC TGACCAOCQC CATXXOOCIG TTOGCCGrTOC AGAACTACCA CS G TOOCLVm 480 

CIGAGCX3TCT ACGTQCAGGC CX3CCAA0CTG CACCTCAGCG TCCTGOGOGA OCTCAGCGTC 540 

TTCGQCXIAGC GCTGGQQCTT CGAOSCOGCC ACCATCAACA GCOQCTACAA (XftOCTGACX: 600 

CQCCTGATCG GCAACTACAC CGACCAOSCX: GTOCQCTC3GT ACAACACCX3G CCTOGAQCGC 660 

GTCTOCSOGTC C0GACAG0CX3 OGACTOGATC AQC5TACAACC AGTrCCXSCCG CGAGCTCACC 720 

CIGAOOGTOC TQGACATOGrr GAGOCTGTIC CXXAACTAOS ACAGCXXX7C CTACCCCATC 780 

CGCACOSTGA QCCAGCTGAC CXXXXSAGATT TACACCAACC OOCntXTOGA GAACTTCXSAC 840 
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GGCAGCTTCC GCQGCAGOGC CCAGGGCATC GAGGGCAGCA TCCGCAGOCC CCACXTTGATC 900 

GACATCCTGA iCAGCATCAC CATCTACACC GACQCXXIACX: Q0GXX3GACTA CTACTOGAGC 960 

GGCX:::A0CAGA TCATQQC?CaG CJCCOGTOQGC TTCAGCQQCC (XX3AG1TCAC CTTCCCCCTC 1020 

TAOQQCACCA TOQQCAAOGC TQCACCICAG CAGCXXATCG TQC5CACA0CT GQGCCAGQGA 1080 

GTGTACCX3CA CXXTCftGCAG CAOCCICTAC OGTCXSACCTT TCAACATOSG'^TCAAC^ 1140 

CAGCAGCTGA GCXSTQCTOGA CX3GCACCGAG aTO3CCTACG QCACCAGCAG CAAOCTCCCC 1200 

ACXX3COGITC3T ACOaCAAGAG OQQCACCX?rG GACAGCXTOG ACGAGATCCC OCTTCAGAAC 1260 

AACAACGTQC CACCTQGACA GQQCTTCaGC CACCCTCTGA GCCACXSTGAG CATOTTCCGC 1320 

AGTOGCTTCA GCAACAGCAG CX3TCAGCATC ATCOSTOCAC CTATCTTCAG CTSGATTCAC 1380 

CX3CAGTQC0G AGTTCAACAA CATCATODCC AGCAGCCAGA TCACCCAGAT CCCOCTGACC 1440 

AAGAGCACCA ACCTQGGCAG CX3GCACCAGC GTQGTGAAQG GCCCCGGCT? CACOSGOGQC 1500 

GACATCOTGC QCaX»CCAG OCXXGQCCAG ATCAGCACCC TOOGCGTGAA CATCACCXXX: 1560 

CCXXrKSAGCC AQCX3CTACX3G CX3TC0GCATC CXCTAOSCCA GCA(X»CCAA CCroCAGTTC 1620 

CACACCAGCA TCGAOaCXXXS OOOCATCAAC OiQQQCAACT TCAGOQQCAC CATCAQCAGC 1680 

GQCAGCAACC TGCAGAQOGG CAQCTTOCXSC ACCX3TQQQCT TCACCAOXC CTICAACITC 1740 

AGCAACGGCA GCAQCXJTGTr O^CCCTCAGC GCCCAOGTGT TCAACAGCX3G CAAOGAGGTC 1800 

TACATCX3ACC GCATCGAGIT CGTQCXDOGCC GAGC3TSACCT TCGAGGCCGA GTAOGACCIG 1860 

GAGAGGQCTC AGAAGQCOGT GAACGAGCTG TICAOCAGCA GCAACXMAT CX3QCCTGAAG 1920 

ACCGACGTGA CXX3ACTACCA CATOGATCAG GTGTAGGAGC T 1961 
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DEPOSITS 

with the Agricultural Research Service. Patent Culture Collection (NRRL), Northern 
Regional Research Center, 1815 North University Street, Peoria, Illinois 61604. U.S.A. 

Strain designation Deposition Number Deposition Date 



Bacillus cereus AB78 
Escherichia co// pCIB4431 



NRRL B-21058 
NRRL B-18998 



March 18. 1998 
September 21. 1992 
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I5UI>Aj-|-::T TKI^ATY on TMH: 'NTEPNATHiN/vi. 
KKC:0GN1TIT0N OF Tlili DiaHiZJiT OF MlCROORCANKSM'J 
FOR THE rURPOSE OF PATE^rr PROCEDUKES 

INTERNATIONAL FORM 



TO 

Ciba«Geigy Corp. 

3054 Cornvallis Road 

Research Triangle Park. NC 27709 

NAME AND ADDRESS 

OF DEPOSITOR . 



RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 
issued pursuant co Rule 7.1 by che 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified ac the bottom of this page 



IDENTIFICATION OF THE MICROORGANISM 



Idenclficacion reference given by the 
DEPOSITOR: 

Escherichia eoli 
PCfiI4431 



Accession nuaber given by the 
INTERNATIONAL DEPOSITARY AUTHORITY: 

NRRL B-18998 



II. SCIENTIFIC DESCRIPTION AND/OR PROPOSED TAXONOMIC DESIGNATION 



The nicroorganism identified under I. above was accompanied by: 
Q a scientific description 

a proposed caxonomic designation 
(Mark with a cross vhere applicable) 



III, RECEIPT AND ACCEPTANCE 



This International Depositary Authority accepts the microorganism identified under I 
above, which was received by it on September 21, 1992 (date of the original deposit)* 



IV, RECEIPT OF REQUEST FOR CONVERSION 



The microorganism identified under I . above was received by this International 
Depositary Authority on ?f the original deposit) and 

a request to convert the original deposit to a deposit under the Budapest Treaty 
was received by it on (<iate of receipt of request for conversion) 



V, INTERNATIONAL DEPOSITARY AUTHORITY 



Name: Agricultural Research Culture 
Collection (NRRL) 
International Depositary Authority 
Address: 1815 N. University Street 

Peoria. Illinois 61604 U.S.A. 



Signature(s) of person(s) having the power 
to represent the International Depositary 
Authority or of authorized officiaKs): 



Date: 



aal Depo 

official 



Where Rule 6.4(d) applies, such date is the date on which the status of international 
depositary authority was acquired. 
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niji)Ari:sT trkaty o tiii- intkunationai. uecocmititon ok -P'^minrit 
OK MicROOiu;. ...ISMS Kou thk riJKi'o::^ or rATOr vhoc, juk 



INTERNATIOUAL TORM 



TO 

Ciba-Gelgy Corp. 

3054 Cornwallis Road 

Research Triangle Park. NC 27709 

SAME AND ADDRESS OF THE PARTY TO WHOM 
THE VIABILITY STATEMENT IS ISSUED 



VIABILITY STATHfEPr 

issued pursuanc to Rule 10.2 by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified ac the bottom of this page 



I. DEPOSITOR 



II. IDENTIFICATION OF THE MICROORGANISM 



Naae: 



Clba-G«lgy Corp. 



3054 Cornwallis Road 
Address: Research Triangle Park, NC 2770^ 



Depositor's taxonoaic designation and 
accession nunber given by the 
INTERNATIONAL DEPOSITARY AUTHORITY: 

Escherichia coli NRRL B-18998 

Date of: September 21. 1992^ 

T] * Original Deposit 
— 2 Mew Deposit 

^ Repropagation of Original Deposit 



III. (a) VIABILITY STATEMENT 



Deposit was found: Rn Viable □ Nonviable on September 23, 1992 (Date) 
International Depositary Authority's preparation was found viable on oeMb« 3. i982(Date)' 

III. (b) DEPOSITOR'S EQUIVALENCY DECLARATION 

Depositor determined the International Depositary Authority's preparation was 

[]] ^ Equivalent Q * Not equivalent to deposit on (Date) 



Signature of Depositor 



IV. CONDITIONS UNDER WHICH THE VIABIUTY TEST WAS PERFORMED (Depositors/Depositary)* 



INTERNATIONAL DEPOSITARY AUTHORITY 



Naae: Agricultural Research Culture 
Collection (NRRL) 
International Depositary Authority 
Address: 1815 N. University Street 

Peoria. Illinois 61604 U.S.A. 



Signature(5) of person(5) having the power 
to represent the International Depositary 
Authority or of authorized official(s): 



Date: 



lBdic«t» th« d«t« of tlM oritina dcpetlt or «h«i • n«w dApoalt hmn ma4m. 
Mark with • exo«a th« •ppLlcabU box. 

In th« c«t«s r«£omd to la RuLi 10.2(o>(ii) «ml CiU). ffr to iho soot mont. vlotoUUty tost, 
fill in it tho latotMtien h«o b««n roquostod. 
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BUDAPEST TRIATT ON TKB ZUrBUMTXqplKL-' 
20CHXTX0M or TMS D0OSZT OF NXWXibM 
POR THB PimPCStS nr PATBir PIIOCtDIIU 



iirrmuTioiUi. mm 



^ii^ r r ^ MCKPT IH THE CASK OF AN ORXCIMAL OBPOSXT 

Ciba-Ceigy Corp. ' purnunc eo Bm« 7.1 by thm 

305i Cornuallis Road mrCRMAT ZONAL OCPOSXTARY AUTIIOiiXTT 

Research Triangle Park. NC 27709 4C ca« boccoa o£ ttiia 



NAMK AMD ADDRESS 
Of DDOSXTOS 



I. IDWXrXCATXOH OP THI N! 



IZ8M 



Xdmeif icattofi r«f«r«nc« 91 van by eh* 

DEPOSXTOR: 

Bacillus cereus 
AB7fi 



Accaasion noaMr 9iVM by tto« 
XHmUUTZQIUa OCPOSXTARY AimiORXTy: 

NRRL B*2I05B 



XI. SCXBirXPXC DCSCRXPTXOM AMD/OR PROPOSI 



m TAXOMONXC OSSXGIUTZON 



Tti« aicroorgMiaa idaatititd wndmt I «bov« vat «eeo^ai«d byi 

' * acAvncific d«seciptien 

« >^ « propoiad tasonoaic dMi9n«tion 
(lUrit vacb A erott vfi«r« appiicabla) 



Xir. RKSZPT AMD ACC0TANCt 



lis intacaaciQaai D«poticary Authericy acnpca cha aieroorgRaiM IdMclCUd uii4«r I 
Ilea lias raeaivbd by It oaHarch 18.l993<4«ta ol tlia ocigiMl dtpotic)^ 



Th 

voica 



iVr-RBCKPT OP RIQUIST FOR COMVBISXON 



Tba «icreor9«iiit« idMititiad ua4mx X abova vas xacaivad by tbia lotatnatioMi 

Dapoauary Aucbority on <daea ct ch. ori9liial dapoait) and 

a raquaac to eonvart tba Qri9ia«i dapoait to a dapoait uiidar tha Budapaat Traaty 

vat taeaivad by it Ob <a.ta ot raeaipt ot tm^mMTtot cobvac.ioa) 



V. XirrSRNATXONAL OHPOSXTARY AUTKORXTT 



Naaa: .^grlculcural Research Culture 
Collection (NKRL) 
Incernacional Depositary Auchoric 

Addraasi ^^^^ University Street 
Peoria. Illinois 61604 U.S.A. 



Sl9n«cuxafa) of paraonfs) havin9 tba poMar 
to capraaaat tba Xataraatlottai Depositary 
Authority or oC autbofiisad o£Slciai(s): 

/■•='• ^ 

3 >w ^-^ 



Wbosa Rttia 6.4|d) appiias, aucb data ia cna data on wnicb tba status of lotarnatiobai danoaxf^r 
autboricy waa aoqairad. «*poBitar 
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E flBflgy C M TBE 
OF KtCKXIBMIXSIC TOT. 



OF msrr Rocsaoiii 



Dr. Rich Locictin 

Ciba*Gtlgy Core. 

3054 CornvAllis Road 

RasMreh Triuyie Park. !:C 27709 



1 



pyEEXSQLSQSBSflBr 



to Pule. 1 0.2 by ti» 
CERSZXKEOf WiUlUR 
on ttaa foUoiaiig 



NME MID AQOBESS OF IBE raER TO IKK 
igg VnBggy SIMBgRT IS ISSOB) | 



1. 



Clba-Ctify Carp. 




3054 ConwmlUs Road 




ftaaaareh triaaaXe ?arkt KC 27709 




Boellltta caraus WOL B-2lQn 















w fmnrtiip ^TiJtoto m ^tawlabl* on March 19. lyy i 



ttaa T iiiiii yi l lual mmasacy law Unity's peniBtiBn «h 
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What is claimed is : 

1. A method for protecting plants including progeny thereof against damage caused 
by Ostrinia furnacalis species comprising directly or indirectly applying to the plant or the 
plant seed or the growing area of the plant as an active ingredient a toxin protein of 
Bacillus species. 

2. A method according to claim 1, wherein the toxin protein is a toxin protein of Bacillus 
thuringiensis. 

3. A method according to claim 1. wherein the toxin protein is a toxin protein of Bacillus 
cereus, 

4. A method according to claim 1, wherein the toxin protein is a Cry-type protein, 
preferably a Cryl-type protein, more preferably a CrylA-type protein, and most preferably a 
CrylA(b) protein. 

5. A method according to claim 4. wherein the toxin protein is a Cryl-type protein 
according to SEQ ID Nos. 53 to 55. 

6. A method according to claim 1, wherein the toxin protein is a VIP-type protein, 
preferably a VIPI-type protein, such as a VIP1 A(a) protein or a VIP1 A(b) protein, or a VIP2- 
type protein, such as a VIP2A(a) protein or a VIP2A(b) protein or a combination of a VIPI- 
type protein and a Vll=*2-type protein. 

7. A method according to claim 1 , wherein the toxin protein is a VIP3-type protein, such 
as a VIP3A(a) protein or a VIP3A(b) protein. 

8. A method according to anyone of claims 6 to 7, wherein the toxin protein is a VIP-type 
protein according to SEQ ID Nos. 1, 2. 4-7. 17-24. 26-32, 35, 36. 39. 40, 42. 43, 45, 46. 49, 

50. 51 or 52. 
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9. The method according to daim 1 , wherein the toxin protein is applied to the plant in 
form of an entomocidal composition. 

10. The method according to claim 9, wherein the entomocidal composition comprises a 
microorganism. 

11. The method according to claim 1 or 9, wherein the toxin protein is a Cryl-type 
protein. 

12. The method according to claim 9, wherein the entomocidal composition comprises 
at least one Cryl-type toxin protein or a microorganism, preferably a Bacillus thuringiensis 
and/or a Bacillus cereus strain, containing at least one gene encoding said toxin protein 
together with a suitable carrier. 

13. The method according to claim 12, wherein the microorganism is a naturally-occurring 
organism containing at least one c/y/-type toxin gene encoding said toxin protein. 

14. A method according to claim 10, wherein the microorganism is a naturally-occurring 
organism containing at least one cry-type toxin gene, preferably a cryMype toxin gene, 
more preferably a c/yM-type toxin gene, and most preferably a crylA(b) toxin gene. 

15. The method according to claim 12, wherein the microorganism is a recombinant 
organism containing at least one c/yAtype toxin gene in recombinant fomi encoding said 
toxin protein. 

16. A method according to claim 10, wherein the microorganism is a recombinant 
organism containing at least one cry-type toxin gene, preferably a c/yMype toxin gene, 
more preferably a c/yM-type toxin gene, and most preferably a crylA(b) toxin gene. 
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17. The method according to claim 10. wherein the entomocidal composition 
comprises at least one VIP-type toxin protein or a microorganism, preferably a Bacillus 
thuringiensis and/or a Bacillus cereus strain, containing at least one gene encoding said 
toxin protein together with a suitable carrier. 

18. A method according to claim 10. wherein the microorganism is a naturally-occurring 
organism containing at least one VIP-type protein gene encoSing the said toxin-pfotein, 
preferably a VIP1 A(a) protein or¥viP1 A(b) protein gene or a VIP2-type protein gene such 
as as a VIP2A(a) protein or a VIP2A(b) protein gene. 

19. A method according to claim 10. wherein the microorganism is a naturally-occurring 
organism containing at least one VIP3-type protein gene such as as a VIP3A(a) protein or a 
VIP3A(b) protein gene encoding the said toxin protein. 

20. A method according to anyone of claims 17-19. wherein the toxin protein is a VIP-type 
protein according to SEQ ID Nos.1. 2. 4-7. 17-24, 26-32, 35. 36. 39. 40, 42. 43. 45. 46, 49. 
50, 51 or 52. 

21 . A method according to anyone of claims 11-16, wherein the toxin protein is a Cry-type 
protein according to SEQ ID Nos. 53 to 55. 

22. Anyone of claims 17 to 21 . wherein the microorganism is a recombinant organism. 

23. The method according to claim 1 to 22, wherein the toxin protein is indirectly 
applied to the plant, by transfonning said plant with a toxin gene encoding a toxin protein 
of Bacillus species and expressing said toxin protein in an amount sufficient to provide 
control against Ostrinia furnacalis {Asian Com Borer) species upon planting the so 
transformed plant in an area where said insect pest may occur. 

24. A method according to claim 23. wherein the toxin gene encodes a toxin protein of 
Bacillus tiiurirtgiensis or Bacillus cereus. 
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25. A method according to claim 23, wherein the toxin gene encodes a Cry-type toxin 
protein, preferably a Cryl-type protein, more preferably a CrylA-type protein, and most 
preferably a CrylA(b) protein such as those provided in SEQ ID Nos. 53 to 55, 

26. A method according to claim 23, wherein the toxin gene encodes a VIP-type toxin 
protein, preferably a VIPI-type protein, such as a VIP1A(a) protein or a VIPIA(b) protein or 
a VIP2-type protein, such as a VIP2A(a)^rotein or a VIP2A(b) protein. 

27. A method according to claim 23, wherein the toxin gene encodes a VIP3-type protein, 
such as as a ViP3A(a) protein or a VIP3A(b) protein. 

28. The method according to claim 23-27, wherein the toxin gene is a synthetic gene 
the codon usage of which is optimized by using the codons which are most preferred in 
plants. 

29. The method according to dalms 23 and 28, wherein the toxin gene encodes a Cryl- 
type protein. 

30. A method according to any one of claims 26 to 27, wherein the toxin gene encodes a 
VIP-type protein according to SEQ ID Nos.1. 2, 4-7, 17-24, 26-32. 35, 36. 39, 40, 42. 43. 
45, 46, 49, 50, 51 or 52. 

31. The method according to any one of claims 1 to 30. wherein the plant to be 
protected is a cereal plant. 

32. The method according to claim 31 . wherein the plant to be protected is a maize 
plant. 

33. A method according to any one of claims 1 to 32, wherein at least a Cry-tpye toxin 
protein is directly or indirectly applied to the plant or the plant seed or the growing area of 



WO97/4610S 



-168- 



PCT/EP97/02737 



the plant to be protected in combination with a VIP-type toxin protein in an amount sufficient 
to provide control against Asian Corn Borer {Ostrinia furnacalis) pests. 

34. The method according to any one of claims 1 to 32, wherein the active ingredient is 
a CrylA(b) protein. 

35. The method accordingjo^ny one of daims 1 to 32, whefeiirthe active ingredient is 
a VIP-type protein. 

36. Use of an active ingredient as defined in the previous claims for controlling Ostrinia 
furnacalis {Asian Com Borer) pests in crop plants. 

37. Use of a transgenic plant as defined in claim 23 for controlling Ostrinia furnacalis 
{Asian Com Borer) pests in crop plants. 

38. Use of recombinant microorganisms or transgenic plants according to anyone of claims 
23 to 30 comprising a DNA molecule which hybridizes to a cry-type or a VIP-type gene 
encoding the respective toxin protein under moderate stringent conditions for controlling 
Ostrinia furnacalis {Asian Com Borer) pests in crop plants. 

39. A commercial bag comprising seed of a transgenic plant comprising a toxin gene 
encoding a toxin protein of Bacillus tliuringiensis and expressing said toxin protein in an 
amount sufficient to provide control against Ostrinia furnacalis {Asian Corn Borer) 
species, together with lable instructions for the use thereof for control of Ostrinia 
furnacalis {Asian Com Borer) pests in crop plants. 

40. A commercial bag comprising seed of a transgenic plant or a microorganism, but 
especially a Bacillus thuringiensis and/or a Bacillus cereus strain, comprising at least a toxin 
gene encoding a toxin protein as defined previously in claims 1 to 30, 32 and 33, and 
expressing the said toxin protein in an amount sufficient to provide control against Ostrinia 
furnacalis {Asian Com Borer) species, together with lable Instmctlons for the use thereof for 
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control of Ostrinia fumacalis (Asian Com Borer) pests in crop plants. 

41 . An agricultural method, wherein a transgenic plant or the progeny thereof is used 
comprising a toxin gene encoding a toxin protein of a Bacillus species and expressing the 
said toxin protein in an amount sufficient to provide control against Ostrinia fumacalis 
(Asian Corn Borer) species or transgenic seed thereof, as defined in anyone of claims 23 
to 32. _ 

42. A method accroding to any one of claims 1 8 to 26, wherein at least a Cry-tpye toxin 
protein is expressed in the plant to be protected in combination with a VlP-type toxin protein 
in an amount sufficient to provide control against Sesamia pests. 

43. A composition comprising as an active ingredient at least a Cry-type toxin protein and 
a VIP-type protein in a insecticidally effective amount together with a agronomically 
acceptable carrier. 
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